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ABSTRACT 

This  report  provides  the  equations  of  evolution  of  an  encounter  involving  a  teleoperated 
vehicle.  The  global  model  contains  interconnected  submodels  describing  the  conventional  exter¬ 
nal  primitives  of  the  encounter  (base  states),  suddenly  occurring  events  (feature  states),  and  a 
dynamic  description  of  the  remote  operator  (the  generalized  operator  model).  This  model  is 
phrased  as  a  set  of  stochastic  differential  equations  that  can  accommodate  both  linear  and  non¬ 
linear  effects.  The  final  section  of  the  report  places  these  results  within  the  context  of  the  multi- 
\  problem,  and  indicates  the  direction  of  future  research  which  will  yield  a  quantitative 
description  of  vehicle  performance  in  a  rapidly  changing  environment. 
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1.  INTRODUCTION 

This  report  presents  the  initial  results  of  an  investigation  seeking  to  determine  the  perfor- 
riiarire  capability  of  a  "pointing-and-tracking”  system  incorporating  a  teleoperated  vehicle  (TOV) 
when  the  remote  operator  is  required  to  perform  several  tasks  simultaneously.  Determining  per¬ 
formance  in  a  multitask  environment  is  significantly  more  complicated  than  it  is  in  the  single 
task  setting.  Not  only  must  the  encounter  dynamics  be  delineated  in  each  of  the  alternative 
modes  of  operation,  but  the  transition  properties  of  the  system  as  it  moves  between  tasks  must  be 
described  in  a  compatible  manner. 

An  encounter  involving  a  TOV  may  have  several  primitive  elements,  and  an  analytical 
description  is  required  for  each.  The  TOV  is  itself  an  electromechanical  device  with  lags,  gains, 
etc.  Its  primary  properties  are  conveniently  phrased  in  terms  of  a  set  of  ordinary  differential 
equations.  These  equations  relate  the  actuating  signals  arising  from  action  by  the  remote  opera¬ 
tor  to  the  dynamic  variables  of  the  TOV;  e.g.,  position,  velocity,  orientation,  etc.  To  tne  extent 
that  there  is  uncertainty  in  the  way  that  the  TOV  will  respond  to  direct  commands,  a  random 
forcing  may  be  included  in  the  dynamic  equation  at  the  TOV. 

The  tasks  which  engage  the  operator’s  attention  often  involve  objects  that  are  much  less 
predictable  than  is  the  TOV.  For  example,  if  the  operator  seeks  to  track  an  evasive  target, 
uncertainty  in  target  motion  reduces  the  incentive  to  utilize  a  high-order  dynamic  model  of  target 
evolution.  Instead,  a  simple  model  driven  by  a  large  amplitude  exogenous  process  is  a  more 
appropriate  description  of  target  behavior. 

Similarly,  if  the  TOV  is  required  to  follow  a  prespecified  path,  the  path  behavior  can  be 
phrased  in  terms  of  an  ordinary  differential  equation,  and  the  operator  causes  the  TOV  states  to 
match  the  associated  path  states.  When  the  path  becomes  less  predictable,  a  stochastic  model  of 
the  path  becomes  expedient. 
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Tke  use  of  stochastic  forcing  terms  is  an  attractive  way  to  quantify  the  uncertainty  that 
exists  in  how  an  encounter  will  evolve.  A  model  of  low  dimension  will  frequently  suffice  to 
describe  such  complex  dynamic  objects  as  tanks,  APCs,  etc.  Indeed,  the  more  uncertainty  that 
surrounds  the  motion  of  one  of  the  primitives,  the  lower  the  dynamic  order  needs  to  be. 

In  order  that  this  procedure  yield  useful  results,  the  behavior  of  the  model  should  duplicate 
that  observed  in  tests  of  the  object  being  modelled.  Frequently  such  empirical  data  is  phrased  in 
terms  of  a  power  spectral  density,  and  the  resulting  model  takes  the  form  of  a  simple  Gauss- 
Markov  process.  First  or  second  order  models  often  suffice  to  give  a  close  approximation  to  the 
power  spectral  density. 

While  this  spectrally-based  approach  is  justified  as  long  as  only  "linear-quadratic"  analysis 
is  required,  it  may  yield  a  model  whose  sample  function  behavior  bears  little  resemblance  to  those 
of  the  object.  A  linear  Gauss-Markov  model  has  continuous  sample  paths  of  great  local  volatil¬ 
ity.  Such  a  model  does  not  have  the  inherent  flexibility  required  to  produce  sample  functions 
having  discrete  changes  at  isolated  points  in  time.  For  example,  while  a  target  operating  in  a 
benign  environment  may  be  well  described  by  Gauss-Markov  process,  a  target  in  a  hostile 
environment  may  execute  evasive  maneuvers  that  involve  sudden  and  unpredictable  changes  in 
acceleration.  Another  example  of  a  discrete  event  which  abruptly  influences  the  evolution  of  an 
encounter  is  that  of  the  sudden  appearance  or  disappearance  of  a  target. 

Changes  of  the  type  described  above  can  be  thought  of  as  a  variation  in  the  mode  of  evolu¬ 
tion  of  the  encounter.  A  modal  descriptor  or  feature  fixes  the  equations  of  motion  which 
currently  govern  the  basic  system  elements.  The  advantage  of  a  TOV  is  that  it  injects  a  human 
intelligence  into  the  loop  dynamics.  The  human  operator  has  a  unique  ability  to  discern  the  fun¬ 
damental  features  of  a  time  varying  and  spatially  cluttered  sequence  of  images,  and  as  a  conse¬ 
quence,  the  control  becomes  contingent  on  the  observed  feature  process. 

While  it  is  desirable  to  construct  a  model  that  matches  the  behavioral  qualities  of  the  exter¬ 
nal  portion  of  the  encounter  as  experienced  by  the  operator  of  the  TOV,  it  is  essential  that  the 
method  of  description  be  simple  in  delineation,  and  lead  to  tractable  analytical  problems.  By 
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"simple  in  delineation"  it  is  meant  that  the  model  has  few  parameters,  and  that  these  parameters 
correspond  to  readily  identifiable  properties  of  sample  behavior.  Note  that  these  two  attributes 
do  not  always  occur  together. 

In  order  to  lead  to  "tractable  analytical  problems,"  the  model  type  must  have  a  well  struct 
tured  "calculus"  or  rules  of  manipulation.  Transfer  function  models  are  in  this  category,  as  are 
the  conventional  Gauss- Markov  models;  but  neither  of  these  model  types  is  sufficiently  compliant 
to  permit  the  inclusion  of  the  relevant  feature  variation. 

In  this  report,  the  elementary  constituents  of  the  encounter  will  be  described  by  stochastic 
differential  equations.  Such  equations  have  the  requisite  properties  noted  above.  Further,  the 
conventional  transfer  function  and  Gauss-Markov  models  are  special  versions  of  a  model  given  in 
terms  of  a  stochastic  differential  equation.  As  with  the  more  prosaic  models,  addition  or  deletion 
of  elementary  components  of  the  encounter  is  easily  accomplished  by  changing  the  order  of  the 
model.  Further,  in  contrast  to  the  previous  models,  it  is  possible  to  investigate  both  linear  and 
nonlinear  operation. 

The  ingredient  that  gives  a  system  containing  a  TOY  its  unique  character  is  the  remote 
operator  himself.  The  remote  operator  provides  the  intermediary  through  which  the  disparate 
elements  are  coupled.  To  provide  a  complete  encounter  model  then,  the  operator’s  dynamic 
behavior  must  be  quantified.  Clearly,  the  operator  model  must  satisfy  the  desiderata  outlined 
above,  and  it  must  be  compatible  with  the  model  of  the  external  portions  of  the  encounter. 

Operator  models  are  fundamentally  more  diverse  than  are  models  of  electromechanical 
objects.  The  human  is  capable  of  so  many  dissimilar  patterns  of  action  that  it  is  quite  difficult  to 
capture  all  of  his  attributes  in  a  single,  simple  model.  Yet,  an  approximation  to  such  a  model  is 
required  to  complete  the  overall  system  description. 

The  issue  of  selecting  a  suitable  operator  model  is  of  primary  concern  in  this  report.  This  is 
a  topic  with  an  interesting  history,  and  it  is  explored  in  much  more  detail  in  the  next  section. 
Suffice  it  to  say  here  that  there  is  a  convenient  trichotomy  of  human  action  based  upon  the  time 
scale  or  planning  horizon  of  the  activity.  Models  based  upon  this  trichotomy  are  referred  to  as 
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knowledge  based  (long  horizon),  rule  based  (intermediate  horizon)  or  skill  based  (short  horizon) 
depending  upon  the  type  of  activity  which  is  being  modelled. 

The  first  two  of  the  above  model  classes  require  an  accurate  description  of  the  operator's 
behavior  in  relatively  ambiguous  environments  Such  models  are  quite  difficult  to  construct,  and 
even  more  difficult  to  verify  since  the  development  of  a  suitable  experimental  protocol  is  a  for¬ 
midable  task. 

The  third  model  category  is  most  applicable  to  the  work  reported  here.  These  short  horizon 
models  are  also  called  reflexive.  This  latter  appellation  will  be  used  here  because  it  more  clearly 
identifies  the  reactive  nature  of  an  operator  engaged  in  the  type  of  tasks  being  studied.  Reflexive 
models  are  based  upc  the  assumption  that  the  trained  operator  is  functioning  in  a  familiar 
environment,  and  has  a  well  defined  objective  or  objectives.  This  is  a  situation  often  encountered 
in  pointing-and-tracking  tasks.  The  resulting  model  includes  both  response  delays  and  the  clutter 
suppression  that  are  inherent  in  the  operator  response  to  external  events.  Such  models  are 
pseudo  input-output  models  in  the  sense  that  the  input  to  the  operator  model  is  a  derived  vari¬ 
able  which  is  not  the  neurological  stimulus  the  operator  actually  receives.  In  this  sense  the 
operator’s  attributes  precede  the  specific  operator  model  in  the  system  description. 

Reflexive  operator  models  have  been  successfully  used  in  several  pointing-and-tracking  sys 
terns.  As  will  be  detailed  in  the  sequel,  transfer  function  models  are  useful  in  stationary  environ¬ 
ments.  and  Gauss-Markov  models  in  nonstationary  environments.  The  parameters  of  the  former 
are  frequently  obtained  empirically,  while  the  latter  may  be  based  on  an  additional  calculation: 
e.g.,  the  minimization  of  a  performance  functional. 

Both  of  the  conventional  model  types  -  transfer  function  and  Gauss-Markov  -  satisfy  the 
requisite  properties  of  simplicity  and  analytical  compatability  with  the  externa)  portion  of  the 
encounter  state,  I'nfortunately  neither  readily  admits  the  feature  variation  so  much  a  part  of 
the  TOV  application.  Feature  dependence  in  the  operator  model  can  be  introduced  on  an  ad  hoc 
basis  by  indexing  a  set  of  stationary  models  to  the  external  features.  While  this  is  a  satisfactory 


-  6- 


approach  when  the  feature  changes  are  infrequent  and  unambiguous,  it  does  not  suffice  for  the 
situations  of  most  interest  here. 

This  report  develops  a  reflexive  model  of  the  remote  operator  of  a  TOV.  This  model  is 
expressed  as  a  (nonlinear)  stochastic  differential  equation,  and  thus  fits  naturally  with  the  other 
encounter  primitives  to  yield  the  total  system  model.  The  development  of  the  model  follows  the 
approach  used  in  creating  the  optimal  control  model,  but  generalizes  this  earlier  work  by  assum¬ 
ing  that  the  features  that  determine  the  realized  time  evolution  of  the  encounter  are  both  random 
and  time  varying. 

Permitting  the  system  equations  to  dep)end  upon  a  changing  mode  of  operation  aids  in  the 
investigation  of  multitask  op>eration  of  the  TOV.  Indeed,  as  an  initial  approximation,  multitask 
operation  can  be  placed  within  the  framework  created  in  this  report  by  simply  making  the 
operator’s  observation  of  the  scene,  situation  dep>endent.  The  discrete  nature  of  task  selection  fits 
naturally  within  the  bounds  of  the  generic  encounter  model. 

The  next  section  provides  a  review  of  the  operator  models  which  have  proven  useful  in  other 
applications  with  an  emphasis  on  reflexive  models.  Section  3  develops  the  operator  model  of 
choice.  It  is  a  nonlinear  dynamic  equation  that  is  responsive  to  feature  variation.  The  model  is 
not  complete  in  a  sense  made  precise  in  Section  3,  and  Section  4  provides  a  useful  approximation 
that  overcomes  this  deficiency  i"  part.  Section  5  reviews  the  development  as  well  as  providing 
suggestions  as  to  how  the  model  should  be  completed,  and  some  implications  of  the  model  charac¬ 
teristics. 

With  the  full  encounter  model  discussed  above,  it  is  now  possible  to  begin  a  study  of  multi¬ 
task  operation  of  the  TOV.  Because  there  is  no  way  to  verify  the  model  parameters  at  present, 
the  full  system  description  provided  here  should  be  thought  of  as  a  preliminary  approximation  to 
actual  system  behavior.  Nevertheless,  a  careful  study  of  this  model  can  be  expected  to  yield  use¬ 
ful  indications  of  performance  sensitivities  and  limitations. 
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2.0  MODELING  THE  HUMAN  OPERATOR 

A  systematic  analysis  of  the  performance  of  the  teleoperated  vehicle  (TOV)  requires  a  model 
for  the  entire  system.  The  model  must  include  a  mathematical  description  of  all  elements  which 
significantly  influence  preselected  measures  of  performance.  From  a  top-down  pioint  of  view, 
important  elements  are  the  vehicle  dynamics,  the  characteristics  of  the  environment  (e.g.  terrain 
and  target  dynamics),  the  observation  system  (stereoscopic  video  and  stereophonic  audio  at 
present)  and  the  human  operator’s  control.  In  this  section,  methods  for  quantitatively  describing 
human  performance  of  representative  TOV  piloting  tasks  are  investigated. 

Section  A  discusses  general  man-machine  systems  and  provides  an  introduction  to  the  issues 
involved  in  modeling  such  systems.  A  hierarchy  of  human  functions,  organized  to  reflect  the 
degree  of  intellectual  involvement  required,  is  identified  based  on  the  formalism  suggested  in  pre¬ 
vious  works  by  psychologists  and  systems  theorists  In  this  framework,  the  role  of  the  TOV 
opierator  is  assessed  and  the  features  of  the  TOV  environment  which  shap>e  it  are  identified. 

In  a  typical  TOV  mission,  the  remote  operator  drives  the  vehicle,  monitors  the  local 
environment  and  responds  to  anticipated  stimuli  in  a  prespecified  fashion.  The  local  environment 
is  unpredictable;  it  may  include,  for  example,  path  obstacles,  sharp  turns  and  multiple  targets. 
The  high  quality  of  the  operator’s  visual  information  (stereoscopic  video  transmitted  via  fiber 
optical  link)  enables  him  to  discern  these  features  and  resijond  to  them.  The  result  is  a  problem 
which  must  integrate  existing  manual  control  results  with  a  model  for  the  human’s  detection  and 
response  capabilities.  A  historical  summary  of  relevant  manual  control  theory  is  therefore 
presented  in  Section  B.  The  review  is  followed  by  a  detailed  description  of  those  models  which 
have  received  the  most  exposure  and  enjoyed  the  greatest  success  in  application  to  7'OV-related 
tasks  (e  g  driving,  target  tracking).  In  particular,  quasi-linear  describing  function  models  (DFM) 
and  the  Optima!  Control  Model  (OCM)  are  presented.  The  emphasis  in  this  section  is  on  previ¬ 
ous  research  and  its  relevance  to  the  manual  control  aspiects  of  TOV  operation. 
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A.  Hierarchy  of  Human  Operator  Activities 

A  theoretical  framework  within  which  the  teleoperated  vehicle  can  be  systematically 
analyzed  would  complement  the  ongoing  experimental  testing  of  the  TOV.  For  this  purpose,  a 
mathematical  model  of  the  composite  man-machine-environment  system  is  being  developed.  Pos¬ 
sible  uses  for  this  model  include; 

•  the  assessment  of  current  system  performance  capabilities  in  multiple  environmental  or 
tracking  scenarios; 

•  the  evaluation  of  the  sensitivity  of  the  TOV  to  various  system  parameters; 

•  the  assessment  of  the  usefulness  of  system  enhancements  such  as  diiver  aids  or  addi¬ 
tional  sensors  (e.g.  motion  sensors). 

The  ultimate  use  of  the  system  model  is  an  important  consideration  in  the  selection  of  a  modeling 
methodology.  For  the  present  aspect  of  TOV  analysis,  the  emphasis  is  on  the  gross  system  per¬ 
formance  as  a  result  of  the  interaction  of  all  system  elements,  and  not  on  the  individual  perfor¬ 
mance  of  any  single  component.  The  submodels  are  thus  selected,  for  a  specific  system 
configuration,  to  reflect  the  influence  of  each  component  on  the  overall  success  or  failure  of  the 
system.  The  submodel  of  interest  here  is  that  of  the  human  operator.  Its  development  requires  a 
careful  evaluation  of  the  role  of  the  human  operator,  a  precise  definition  of  the  tasks  he  must  per¬ 
form,  and  an  evaluation  of  the  importance  of  each.  A  review  of  some  perspectives  on  human  task 
analysis  provides  a  framework  within  which  these  issues  can  be  addressed. 

Johannsen  [1982]  identifies  all  human  tasks  as  falling  in  one  of  two  categories,  controlling  or 
problem  solving.  Tasks  in  the  flrst  category  include  classical  continuous  control  tasks  as  well  as 
any  other  action  oriented  activities  which  produce  system  outputs.  Models  for  these  tasks  are 
numerous  and  well  documented.  Some  are  presented  in  the  next  subsection.  The  second  category 
of  tasks,  the  problem  solving  tasks,  tend  to  be  internally  rather  than  physically  demanding  and 
require  a  higher  level  of  cognitive  involvement.  Examples  include;  the  formulation  and 
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modification  of  plans;  the  assessment  of  alarm  situations;  and  the  development  and  initiation  of 
control  procedures  to  combat  such  situations.  Problem  solving  tasks  generally  involve  the 
development,  modification  and  utilization  of  the  broad  knowledge  base  characteristic  of  human 
beings.  Models  for  these  tasks  are  much  more  difficult  to  develop  since  they  must  necessarily 
include  many  of  the  psycho-social  factors  which  govern  human  behavior.  Such  factors  are 
difficult  to  describe  in  the  quantitative  manner  generally  desired  by  mathematical  modelers. 

In  ah  attempt  to  bridge  the  gap  which  existed  between  modelers  who  were  addressing 
specific  manual  control  problems  and  others  who  viewed  human  behavior  from  a  more  broad 
psychological  perspective,  Johannsen  and  Rouse  [1978]  proposed  a  framework  within  which 
human  activities  could  be  organized.  Their  hierarchical  perspective,  illustrated  in  Figure  2.1,  is 
amenable  to  a  quantitative  computer-like  interpretation  of  human  functions,  but  at  the  same 
time  accounts  for  higher  level  psychological  and  intellectual  activities  such  as  reflecting  and  plan¬ 
ning.  At  the  lower  level  of  the  diagram,  the  activities  correspond  to  essentially  automatic 
behaviours.  In  highly  trained  operators,  such  behaviors,  once  learned,  become  reflex-like  and  are 
probably  performed  at  the  level  of  the  cerebellum.  Johannsen  and  Rouse  point  out  that  events 
which  necessitate  these  activities  tend  to  occur  more  frequently  than  those  which  activate  higher 
level  processes.  The  implication  is,  however,  that  although  the  time  horizons  involved  in  the 
low-level  processes  are  much  shorter,  they  are  not  considered  more  frequently  by  the  high  level 
processor  (in  this  case  the  cerebrum  with  its  enormous  knowledge  base  and  reasoning  capabili¬ 
ties).  In  fact,  they  are  viewed  as  essentially  autonomous.  The  authors  draw  an  illustrative  anal¬ 
ogy  to  a  time  sharing  computer  system  in  which  certain  programs  are  executed  by  peripheral  dev¬ 
ices  and  rarely  require  intervention  by  the  operating  system. 

Johannsen  and  Rouse  additionally  acknowledge  that  human  planning  itself  can  be  viewed  as 
a  hierarchical  procedure.  At  the  highest  level,  a  broad  plan  is  developed  to  address  major  goals. 
This  plan  may  not  involve  specific  activities  or  require  immediate  implementation.  As  it  is  exe¬ 
cuted,  however,  goals  are  partitioned  into  lower  and  lower  subgoals.  Eventually  the  times 
between  subgoal  identification,  corresponding  plan  development,  and  ultimate  plan  execution 
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Figure  2.1:  Hierarchical  Multi-Level  Structure  of  Human  Activities 

become  very  short.  The  authors  assert  that  at  this  point  the  planning  is  probably  uneorueiaiu 
and  that  a  concise  system  dynamics  model,  in  terms  of  quantitative  state  transitions,  "probably 
provides  a  reasonable  description  of  human  behavior.”  This  issue  is  discussed  again  later  in  this 
subsection  with  reference  to  the  autopilot  of  Nitao  and  Parodi. 

Rasmussen  [1980]  continued  the  trend  toward  a  hierarchical  representation  of  human  con¬ 
trolling  and  problem  solving  behaviors  with  the  more  precise  breakdown  illustrated  by  the  block 
diagram  in  Figure  2.2  (Adopted  from  Phatak  [1983]).  He  phrases  the  behaviors  commonly 
identified  in  the  field  of  behavioral  psychology  in  system  theoretic  terms.  The  interesting  feature 
of  Rasmussen's  structure  is  the  inclusion  of  "shunts”  or  shortcuts  which  allow  the  bypass  of 
unneeded  intellectual  processes  as  the  state  of  the  system  demands.  The  path  indicated  by  the 
first  shunt  represents  the  lowest  level  of  human  activity,  what  Rasmussen  calls  "skill  based" 
behavior.  This  type  of  behavior  is  exhibited  in  situations  where  the  operator  is  familiar  with  the 
observed  state  of  the  system;  the  features  be  observes  have  been  experienced  before  and  evoke  an 
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-  Visual 

-  Vestibular 

-  Auditory 


Figure  2.2:  Functional  Tasks  Performed  by  the  Human. 


immediate  response.  As  an  example,  an  experienced  driver  continuously  and  automatically 
adjusts  his  steering  and  acceleration  to  maintain  his  desired  position  on  the  road.  Even  in  the 
event  of  an  abrupt  curve  in  his  path,  his  response  is  essentially  automatic. 

The  path  indicated  by  the  second  shunt  represents  the  execution  of  "rule  based"  behaviors. 
These  involve  higher  cognitive  facilities  than  do  skill  based  behaviors  because  a  conscious  assess¬ 
ment  of  the  situation  is  required  prior  to  the  initiation  of  an  appropriate  action  or  procedure. 
However,  once  the  situation  is  assessed,  the  appropriate  action  is  assumed  clear.  Rule-based 
behaviors,  therefore,  do  not  require  the  intellectual  capacities  involved  in  the  on-line  development 
of  new  plans  to  handle  unanticipated  situations. 

Finally,  the  highest  level  of  human  behavior,  indicated  by  the  path  without  shunts  in  Figure 
2.2,  Rasmussen  terms  "knowledge  l>ased"  behavior.  At  this  level,  the  human  utilizes  his  basic 
knowledge  of  the  system  and  accumulated  expertise  to  generate  plans  and  procedures  to  be  used 
in  the  accomplishment  of  major  goals.  Due  to  the  lack  of  detailed  information,  these  plans  may 
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be  broad  and  sketchy,  perhaps  subject  only  to  vague  verbal  interpretation.  Behaviors  of  this 
type,  are  generally  exhibited  in  the  presence  of  previously  unexperienced  circumstances.  Such 
situations  are  necessarily- characterized  by  a  low  frequency  of  occurrence. 

Rasmussen’s  human  behavioral  structure  of  Figure  2.2  can  be  included  as  the  feedback  ele¬ 
ment  in  a  closed  loop  syst-em.  The  result  is  the  configuration  of  nested  feedback  loops  illustrated 
in  Figure  2.3.  Consistent  with  the  previous  observation  that  changes  in  the  system  which  require 
global  replanning  are  infrequent  (or  at  least  slow  to  evolve),  the  outer  (knowledge-based)  plan¬ 
ning  loop  has  a  long  time  constant.  Similar  time  scale  interpretations  apply  to  the  inner  loops. 

In  their  work  on  an  autonomous  land  vehicle  (ALV),  Nitao  and  Parodi,  jl985|  take  the  ideas 
of  "frequency  of  critical'  events"  and  the  associated  time  scale  interpretations  one  step  further. 
They  propose  a  hierarchy  of  autopilot  functions  which  are  in  fact  characterized  in  terms  of  the 
time  horizons  involved  in  the  feedback  loops.  Although  these  functions  are  jaerformed  by 
hardware  and  software  modules  rather  than  a  human,  the  authors’  analytical  perspective  on  the 
functions  required  to  drive  a  vehicle  in  an  uncertain  and  cluttered  environment  provides  a  frame¬ 
work  within  which  the  role  of  the  human  teleoperator  can  be  analyzed. 

Figure  2.4  illustrates  the  space-time  hierarchy  of  the  ALV  piloting  functions.  In  the  outer 
loop,  the  "Planner"  software  uses  a  broad  view  of  the  world  (e.g.  terrain  and  elevation  informa¬ 
tion)  to  generate  a  global  plan.  An  example  is  "proceed  along  the  road  until  landmark  ‘x’  is 
passed,  then  ...".  Such  a  plan  is  developed  initially,  and  due  to  its  generality,  requires  revision 
only  in  the  event  of  a  drastic  change  in  the  world  view.  Such  changes  are  assumed  to  occur  either 
infrequently  or  very  slowly.  The  loop  is  characterized  by  a  time  horizon  on  the  order  of 
T  >  10^  sec,  presumably  on  the  order  of  that  of  the  entire  mission.  For  representative  TOV  mis¬ 
sions  of  the  type  considered  in  the  present  analysis,  it  is  assumed  that  the  type  of  planning 
modeled  by  this  slow  outer  loop  is  performed  a  prion.  Thus,  the  overall  online  effect  of  this  loop 
is  eliminated.  Missions  in  which  high-level  know  ledge- based  planning  behavior  influence  the  per¬ 
formance  are  considered  anomalous.  Examples  of  situations  not  considered  here  are: 


Control  Action 


Figure  2.3;  Function.s  of  tlie  Humnn  Oi)rrator  (Closed  Loop) 
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Figure  2.4:  Space-Time  Relationship  for  a  Hierarchical  Planning  System 
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•  loss  of  brakes  or  other  mechanical  failures; 

•  drastic  terrain  alteration  caused  by  an  event  on  the  scale  of  a  natural  disaster; 

•  a  change  in  overall  strategic  objective. 

At  the  intermediate  level  of  Figure  2.4,  a  software  module  called  the  "Observer"  interprets 
the  general  plan  in  light  of  additional  information  collected  en  route  by  the  on  board  sensors. 
The  situation  is  assessed  and  the  abstract  plan  converted  into  a  concrete  form.  The  result  is  a 
feasible  path,  specified  in  coordinates  meaningful  to  the  pilot  and  sensors,  which  extends  into  the 
visible  future.  A  localized  version  of  the  path  is  sent  to  the  "mapmaker"  which  generates  a 
detailed  map  of  the  path  in  the  immediate  vicinity  of  the  vehicle.  The  map  includes  the  broad 
path  borders,  sensor  visibility  limits,  and  obstacle  information.  The  latter  are  generated  by  a 
sonic  imaging  sensor.  This  map  and  additional  vehicle  velocity  data  are  sent  to  the  next  loop, 
the  functional  component  of  which  is  called  the  "reflexive"  pilot. 

It  is  the  role  of  the  reflexive  pilot  to  guide  the  vehicle  along  a  dynamically  feasible  route 
within  the  planned  path  while  avoiding  previously  undetected  obstacles.  In  Johannser’s  termi¬ 
nology  the  pilot  must  "execute"  the  plan  passed  down  from  the  observer.  This  involves  two  dis¬ 
tinct  levels  of  processing.  In  the  higher  of  the  two,  the  reflexive  pilot  utilizes  the  detailed  local 
map  generated  by  the  mapmaker  to  formulate  and  select  possible  subgo2ds.  At  this  level, 
subgoals  are  defined  as  feasible  directions  in  which  the  vehicle  could  proceed  so  as  to  stay  on  ti.c 
path,  avoid  nearby  obstacles  and  make  progress  in  the  overall  goal  direction.  One  of  these 
subgoals  is  then  selected  based  on  a  weighting  of  the  factors  above  and  vehicle  dynamics.  In  this 
sense,  the  reflexive  pilot  is  a  low  level  planner.  The  constraints  that  govern  its  activity,  however, 
are  so  strict  (temporally  and  spatially)  that  straight  forward  mathematical  algorithms  perform 
the  subgoal  generation  and  selection  tasks  adequately.  This  activity  is  comparable  to  the  low 
level  planning  Johannsen  and  Rouse  described  as  "unconscious"  and  "automatic"  in  the  human 
being.  The  time  horizons  associated  with  this  reflexive  planning  loop  are  on  the  order  of  1  sec. 
This  is  consistent  with  both  the  relatively  high  frequency  of  events  which  change  the  relevant 
local  view  (e.g.  appearance  of  an  obstacle,  movement  of  an  obstacle  due  to  erroneous  sensing,  or 
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appt-arance  of  a  curve  in  the  road),  and  the  need  for  rapid  control  action  to  respond  to  these 

fcai  ures. 

The  inner  loop  in  Figure  2.4  contains  the  control  algorithms  and  actuators.  Its  primary 
functions  are  to  respond  to  small  perturbations  in  the  desired  path  (i.e.  to  reduce  noise),  and  to 
execute  control  commands  generated  by  the  reflexive  pilot.  The  loop  is  characterized  by  a  very 
short  lime  scale  (r<.l  sec).  The  functions  performed  in  this  loop  are  on  the  approximate  level  of 
those  modeled  by  a  vast  majority  of  manual  control  algorithms. 

In  light  of  the  development  above,  the  role  of  the  teleoperator  in  representative  TOV  mis¬ 
sions  is  now  made  more  explicit.  Assumptions  about  the  mission,  tasks,  human,  and  environ¬ 
ment  which  determine  the  structural  requirements  of  the  operator  model  are  presented. 

As  mentioned  above,  it  is  assumed  that  a  global  plan  for  operation  is  developed  a  priori  and 
that  for  the  current  analysis,  online  revisions  on  the  part  of  the  operator  are  not  required.  Execu¬ 
tion  of  this  plan  is  assumed  to  involve  such  manual  control  tasks  as  traversing  a  smooth  or  tortu¬ 
ous  path,  or  tracking  a  target  for  the  purpose  of  identification,  designation  or  eventual  weapons 
release.  Simultaneous  activities  may  be  required.  The  human  operator  is  assumed  to  be  well 
trained  in  accomplishing  the  relevant  tasks;  he  is  familiar  with  the  dynamics  of  the  vehicle  and 
control  system,  and  has  performed  similar  tasks  before. 

Under  these  assumptions,  the  human's  behavior  and  its  effect  on  the  system,  are  character¬ 
ized  by  the  nature  of  the  task  environment.  His  behavior  largely  reflects  properties  of  the 
environment  in  light  of  the  curent  goals  [Newell  and  Simon  (1972)].  For  example,  when  the  road 
is  relatively  straight  and  the  tracked  target  is  well-defined  and  exhibits  only  benign  or  predictable 
maneuvers,  the  human's  control  behavior  consists  of  simple  automatic  responses  which  are  well 
modeled  in  concise  control  theoretic  terms.  Given  a  goal  and  svstem  constraints,  the  control 
methodology  is  relatively  unambiguous  and  the  operator's  primary  function  is  that  of  noise 
reduction.  His  function  is  characterized  by  the  inner  most  loop  in  Figure  2.4. 
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The  TOV  pilot,  however,  is  expected  to  perform  the  types  of  behaviors  discussed  above  in  a 
natural  environment  composed  of  multiple,  ambiguous  stimuli  which  can  change  from  moment  to 
moment.  Examples  are: 

•  sharp  turns  in  the  road  or  steep  grades; 

•  the  appearance  or  disappearance  of  obstacles  or  targets; 

•  sudden  changes  in  target  acceleration  or  orientation. 

A  specific  combination  of  any  of  these  we  call  a  “feature”  of  the  environment.  For  the 
present  analysis,  we  assume  that  the  features  of  interest  to  the  driver  can  be  enumerated,  and 
that  he  has  a  notion  of  how  they  might  evolve.  Details  are  presented  in  Section  3. 

The  properties  of  the  environment  described  above  can  be  compared  to  those  which  are 
input  to  the  reflexive  pilot  module  in  the  hierarchy  of  Nitao  and  Parodi.  They  are  characterized 
by  a  high  frequency  of  transition  relative  to  the  time  scale  of  the  mission.  In  the  ALV  autopilot, 
the  mapmaker  generates  the  local  detailed  map  which  the  reflexive  pilot  uses  to  define  its  control 
behavior.  In  the  teleoperated  vehicle,  the  human  performs  both  these  tasks.  His  vision  enables 
him  to  generate  a  map  of  the  immediate  vicinity  of  interest.  In  this  case  the  "map”  includes  esti¬ 
mates  of  features  and  their  uncertainties,  and  the  "vicinity  of  interest”  may  be  in  his  path  or  that 
of  a  target.  Based  on  the  map,  the  teleoperator  performs  the  role  of  the  reflexive  pilot;  that  is,  he 
generates  vehicle  or  tracking  control  commands  which  are  res{x>nsive  to  features  in  his  map.  As 
in  the  familiar  case  of  an  experienced  driver  who,  when  he  encounters  a  turn  in  the  road, 
automatically  adjusts  his  steering  and  acceleration,  it  is  assumed  that  due  to  experience  and 
training,  the  teleoperator’s  response  requires  little  reflection  and  is  essentially  automatic. 

Recall  the  comparison  of  reflexive  planning  as  performed  by  the  ALV  autopilot  to  skill 
based  behavior  in  the  human  being.  In  both  cases,  a  connection  was  made  between  the  time  hor¬ 
izons  of  changes  in  the  world  view,  and  the  level  of  automation  of  the  response.  In  the  human  it 
was  proposed  that  such  behaviors  are  triggered  at  the  level  of  the  cerebellum  and  that  models  for 
these  behaviors  can  be  developed  without  taking  into  account  the  psycho-social  aspects  of 
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humanity.  In  the  ALV  autopilot,  sequential  software  algorithms  executed  in  real  time  proved 
¥ 

adequate  for  the  accomplishment  of  this  type  of  function. 

From  the  perspective  of  the  hierarchies  presented  in  this  section,  it  is  postulated  that  the 
majority  of  human  behavior  exhibited  during  the  execution  of  typical  TOV  missions  is  "reflexive" 
or  "skill  based".  Although  in  a  human  these  classifications  can  never  be  absolute  or  distinct,  the 
implications  associated  with  them,  recast  the  general  problem  in  a  more  tractable  form.  With 
reference  to  this  somewhat  restricted  view  of  the  TOV  mission  and  teleoperator’s  role,  a  model 
for  the  system  is  proposed.  The  model  accounts  for  a  higher  level  of  human  control  behavior 
than  most  earlier  models.  The  increment  is  illustrated  by  the  outer  loop  of  the  reflexive  pilot.  In 
the  next  subsection  earlier  models  are  reviewed.  In  Section  3,  the  present  model  is  developed. 
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B.  Review  of  Human  Controller  Modelling  Theory 

The  use  of  mathematical  modeling  as  a  tool  for  the  analysis  Df  manned  system  performance 
has  been  the  subject  of  considerable  research  for  the  past  40  years.  As  a  function  of  the  needs, 
point  of  view  and  background  of  the  researcher,  models  have  been  developed  based  on  physiology 
[Johannsen  (l971)j,  psychology  ISiegel  and  Wolf  (1969)j,  cognitive  science  [Newell  and  Simon 
(1972)]  and  systems  theory  [Tustin  (1947),  McRuer  and  Krendel  (1959),  Kleinman,  and  Levison 
(1969)1. 

Much  of  the  impetus  behind  modern  manual  control  research  came  from  the  pioneering 
work  of  feedback  control  engineers  during  and  immediately  following  World  War  II.  Tustin 
1 1947]  was  among  the  first  to  compare  the  control  behavior  of  a  human  to  that  of  an  inanimate 
feedback  device,  thus  laying  the  groundwork  for  what  has  come  to  be  known  as  the  control 
theoretic  approach  to  human  performance  modeling.  The  earliest  research  was  dictated  by  the 
development  of  complex  weapons  systems  (e.g.  power  driven  guns);  more  recently  models  for  air¬ 
craft  piloting  [McRuer  and  Graham  (1963),  Kleinman  and  Killingsworth  (1974)],  ship  piloting 
[Veldhuyzen  and  Stassen  (1977)],  automobile  steering  and  following  [McRuer  and  Weir  (1969), 
(1977),  Bekey  et  al.  (1977)],  and  modern  artillery  system  operation  [Phatak  et  al.  (1977),  Klein¬ 
man  (1981)1  have  been  developed  based  on  the  control  theoretic  perspective. 

The  driving  factor  in  the  investigations  referenced  above  was  the  existence  of  a  technological 
system  which  could  only  operate  in  concert  with  a  human  being  acting  in  a  manual  control  capa¬ 
city.  Overall,  the  approach  has  been  particularly  successful  in  quantitatively  modeling  human 
performance  in  tasks  which  involve  rapidly  responding  systems  with  severe  constraints  on  human 
performance.  For  these  systems,  the  models  have  been  successful  largely  because  the  operator  is 
faced  with  a  task  which  demands  his  constant  attention  and  response,  and  allows  little  reflective 
thought.  Consequently,  his  performance  is  dominated  by  his  control  behavior  rather  than  his 
reasoning  powers  or  problem  solving  capabilities.  The  TOV  system,  represents  a  version  of  such 
a  system  and  is  thus  a  candidate  for  application  of  control  theoretic  methodologies. 
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Briefly,  the  TOV  system  is  unlike  previous  applications  in  that  it  operates  in  a  an  unpredict¬ 
able  environment  that  exhibits  various  features  which  the  teleoperator  tries  to  identify.  In  the 
rase  that  his  detection  of  these  features  enhances  his  ability  to  p>erform,  his  role  is  no  longer  that 
of  a  simple  controller  performing  a  single  manual  control  task  in  a  predictable  yet  noisy  environ¬ 
ment.  He  must  also  adapt  his  response  to  those  environmental  features  which  may  result  in 
changes  in  the  system  dynamics.  The  key  observation,  moreover,  is  that  his  behaviors  remain  in 
the  class  of  "reflexive"  behaviors  described  in  section  2. A,  and  his  primary  responsibilities  in  the 
realm  of  manual  control  (i.e.  vehicle  guidance  and/or  target  tracking).  In  particular,  it  is 
assumed  that  as  he  becomes  aware  of  features  in  the  environment  he  reacts  according  to  some 
predetermined  plan  or  set  of  rules.  That  is,  he  has  experienced  the  feature  before  and  his  response 
is  essentially  automatic. 

In  the  remainder  of  this  section,  a  discussion  of  the  perspectives  underlying  the  control 
theoretic  approach  and  a  review  of  the  most  popular  models  which  have  resulted  are  presented. 
It  is  concluded  with  a  critical  appraisal  of  their  utility  in  modeling  the  TOV  pilot. 
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The  Control  Theoretic  Approach  to  Human  Operator  Modeling 

The  generic  block  diagram  for  a  manual  control  system  is  shown  in  Figure  2.5  A  few  obser¬ 
vations  illustrate  several  perspectives  which  characterize  the  control  theoretic  approach  to  human 
operator  modeling.  First,  the  primary  goal  is  a  model  which  is  useful  for  predicting/analyzing 
total  system  performance.  The  human  is  viewed  simply  as  one  of  several  system  elements,  the 
input/  output  behavior  of  which  must  be  mathematically  described  in  order  that  the  performance 
of  the  integrated  man-machine-environment  system  may  be  analytically  investigated.  Note  that 
the  existence  of  compatible  models  for  the  direct  task  environment  as  well  as  the  controlled  ele¬ 
ment  are  thus  implicitly  assumed.  With  this  approach,  the  analysis  begins  with  system  con¬ 
siderations,  (e.g.  task  goals  and  human  limitations)  rather  than  a  direct  analysis  of  the  human 
element.  The  human  is  modeled  from  a  functional  or  behavioral  standpoint  rather  than  by  the 
more  traditional  approach  in  which  his  performance  is  synthesised  from  a  sequence  of  models  for 
elementarv  physiological,  neurological  and/or  cognitive  activities  (e.g.  eyeball  motions,  knob 
turns,  memory  recalls).  The  resulting  models  tend  to  be  less  task  specific  than  those  previously 
obtained. 

.Another  idea  underlying  the  control  theoretic  approach  is  the  characterization  ot  the  human 
as  an  element  in  a  feedback  loop  who  correspondingly  adopts  characteristics  such  that  the  closed 
loop  system  dynamics  approximate  those  of  a  "good"  feedback  system.  The  exact  definition  of 
"good"  is  of  course  dependent  on  the  type  of  model  which  is  used.  This  is  discussed  more  fully  in 
the  individual  model  descriptions  below. 

Based  on  the  control  theoretic  perspectives  established  above,  numerous  methods  for 
rf'presenting  and  evaluating  human  performance  m  a  wide  range  of  tasks  have  been  proposed. 
The  model  structures  vary  but  can.  in  general,  be  classified  into  three  groups.  The  first  group 
comprises  those  models  which  rely  on  linear  system  theory  in  the  frequency  domain  to  describe 
and  evaluate  the  human  s  control  behavior.  [Tustin,  (1947),  McRuer  et  al.  (1967),  Anderson 
(1970)  .  The  description  is  based  on  stability  of  the  entire  man-system  control  loop  Of  all  the 
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Figure  2.5:  Composite  Human/Vehicle/Environment  System 
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models  in  this  class,  the  quasi-linear  describing  function  model  and,  in  particular,  the  "cross¬ 
over"  model  of  McRuer  and  Jex  !l967:  have  emerged  as  the  most  dominant.  These  are  described 
in  more  detail  iii  the  next  subsection.  As  is  often  the  case  in  applications  fields,  the  second  class 
of  models  emerged  to  reflect  the  1960’s  trend  in  systems  analysis  from  the  use  of  frequency 
domain  toward  the  use  of  time  domain  techniques.  These  models  rely  heavily  on  state  space 
methods  to  represent  human  limitations,  perceptual  processes,  and  information  processing  and 
control  decision  capabilities.  The  most  sophisticated  and  well  validated  model  in  this  class  is  the 
Optimal  Control  Model  (OCM)  of  Kleinman  et  al.  [(1969),  (1971)).  Since  its  formulation,  this 
model  has,  in  varying  forms,  enjoyed  considerable  attention  and  multiple  applications.  It  is 
described  in  detail  below.  The  last  class  of  models  includes  an  enormous  number  of  nonlinear, 
finite  state  and  discrete  models.  The  motivation,  success,  and  applications  of  a  few  of  these  types 
of  models  are  discussed  briefly  at  the  end  of  this  subsection. 
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Quasilinear  Describing  Function  Method  (DFM) 

Quasilinear  Describing  Function  models  are  the  most  widely  used  and  well-  validated 
human  operator  models  in  the  class  of  models  which  apply  frequency  domain  methods  to 
represent  and  evaluate  the  system.  They  have  been  highly  successful  in  modeling  human 
behavior  in  the  limited  but  important  class  of  stationary  compensatory  tracking  tasks.  In  these 
tasks,  the  operator  observes  the  error  between  desired  and  actual  output  and  by  manual  means 
acts  to  null  or  "compensate”  for  the  error.  The  majority  of  applications  have  involved  automo¬ 
bile  steering  and  aircraft  piloting  [see  e.g.  Ashkenas  and  McRuer  (1962),  McRuer  and  Graham 
(1963),  McRuer  and  Wier  (1969)].  Models  based  on  the  describing  function  method  are  unstruc¬ 
tured;  they  attempt  to  describe  human  input/output  response  by  the  adoption  of  a  model  form 
and  the  selection  of  model  parameters  which  give  the  best  fit  to  data  available  for  a  given  task. 
During  the  development  stage  there  was  no  attempt  to  mimic  the  human's  physiological  struc¬ 
ture,  although  certain  analogs  have  been  identified  since.  These  are  discussed  later  in  this  subsec¬ 
tion. 

The  structure  of  the  compensatory  tracking  systems  typically  modeled  with  describing  func¬ 
tion  models  is  illustrated  in  the  block  diagram  of  Figure  2.6.  The  characteristics  of  the  controlled 
vehicle  and  control  actuator  are  lumped  into  the  block  labeled  "controlled  element  dynamics". 
The  human  operator  block  may  include  nonlinearities.  To  the  extent,  however,  that  the  man- 
machine  system  operates  under  stationary  conditions,  and  that  a  linear  model  can  account  for  a 
significant  p)ortion  of  the  human's  control  action,  a  quasi-linear  approach  to  modeling  the 
operator's  response  is  appropriate. 

For  the  quasi-linear  approach,  the  human's  control  response,  c, ,  is  represented  as  the  sum  of 
two  components; 

c,  =  I,  +ri,  (2.1) 

where  I,  is  the  response  of  an  "equivalent"  linear  element  and  ry,  is  the  "remnant".  In  the 
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Figure  2.6:  Single  Loop  Compensaton-  Tracking  System 

frequency  domain 

C(;u;)  =  G(]u;)E{j^)  -  N(ju;)  (2.2) 

where  G{j(jj)  is  the  "describing  function"  and  E{j^)  the  transform  of  the  input  to  the  human. 
Figure  2.7  illustrates  the  equivalent  block  diagram.  Signals  are  represented  in  the  frequency 
domain  to  emphasize  the  stationarity  requirement.  The  display  dynamics  are  lumped  in  the  con¬ 


trolled  element  block. 
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Figure  2.T.  Equivalent  Block  Diagram  of  the  Human  Operator  in  a  Continuous  Control  Task 


The  determination  of  a  describing  function  for  a  specific  non-linear  element  depends  on  the 
nature  of  the  input.  Commonly  used  input  types  are  periodic  (in  particular  sinusoidal)  or  ran¬ 
dom  with  specified  stochastic  properties.  The  case  of  sinusoidal  inputs  provides  a  simple,  illustra¬ 
tive  example  of  the  technique. 

Denote  the  response  of  a  nonlinear  element  by  c,  and  the  input  by  e, .  For  a  memory  less 
system  their  relationship  is 

c,  =  /(c.)  (2.3) 

If  e,  is  sinusoidal,  i.e.  e^  ^  E  sinwt,  the  response  is  likely  to  be  non-sinusoidal  but  periodic  with 
the  same  period  as  the  input.  It  thus  has  a  Fourier  series  expansion  which  is  the  sum  of  a  funda¬ 
mental  and  all  higher  order  harmonics.  The  fundamental  is  related  to  the  input  by  an  amplitude 
ratio  and  a  phase  shift.  This  relationship  defines  the  describing  function  of  the  element  in  the 
same  manner  that  it  defines  the  transfer  function  for  a  linear  system.  If  the  nonlinear  element  is 
memoryless,  the  output  is  in  phase  with  the  input  and  the  describing  function  is  simply  the 
Fourier  coefficient  of  the  fundamenttd;  i.e.  the  ratio  of  the  amplitude  of  the  fundamental  to  that 


of  the  input. 
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The  describing  function  derived  for  sinusoidal  inputs  has  another  interpretation  which  plays 
an  important  role  in  the  derivation  of  the  D.F.  for  random  inputs.  It  is  a  well-known  result  of 
the  theory  of  Fourier  series  for  pjeriodic  functions  that  the  Fourier  coefficient  of  each  term 
minimizes  the  mean-squared  error  between  the  associated  b<isis  element  and  the  original  function. 
For  example,  if  f(l)  has  the  Fourier  sine  series 

*  fit)  =  E  sin/fcuT  (2.4) 

then 


Cjf  =  C:  min  11/  -  C  E  s'lnku^T  is  achieved;  k  =  1,2,  ■  •  •  (2.5) 

where  the  norm  |i  •  Hj  is  the  L^[T)  norm 

T 

\\9  h  9(t)^dty/^  (2.6) 

Thus,  in  the  memoryless  case,  the  describing  function  is  the  equivalent  gain,  and 

r 

=  K:  min  (c,  -Ke,))^dt  is  achieved,  (2.“) 

where  t,  =  E  sin  ujt 

In  practical  manual  control  systems,  the  types  of  inputs  most  commonly  encountered  by  the 
human  are  random  or  random  appearing.  These  inputs  have  no  interpretation  in  terms  of  a 
Fourier  fundamental  and  higher  harmonics,  but  are  instead  specified  in  terms  of  their  statistical 
properties.  An  extension  of  the  interpretation  of  the  describing  function  as  a  linear  approxima¬ 
tion  which  minimizes  mean  squared  error  is  now  presented  for  random  inputs. 

The  problem  Booton  (1954)  is  to  find  a  linear  element  with  impulse  response  g  (r)  surh 

that 


min  !i  c. 


30 


(2  8) 


is  achieved.  The  function  c^  again  represents  actual  operator  response,  and  c,  his  input  The 
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iiorrn  for  a  stationary  second  order  random  process  is  defined  in  terms  of  the  probability  density 
fuiiriion  of  the  underlying  random  variables  as 


9(1)112  =  jf:  I  9(^)i 


:‘/2 


Ulp> 

J  9{^)^p{x)dx 


1/2 


(2.9) 


riie  calculus  of  variations  yields  the  defining  equation  for  the  describing  function,  G{juj),  as  the 
solution  of  (2.8); 


00 

I 


r  >  0 


or 


=  G(>a;)$„(w) 


(2.10) 


w  here  /?„ ,  and  are  the  cross-correlation  function  and  the  power  spectral  density  of  the  opera¬ 
tor  inputs  and  outputs.  This  usual  result  for  a  linear  system  provides,  in  fact,  the  defining  rela¬ 
tion  for  the  equivalent  linear  element  in  a  non-linear  system.  This  is  not  unexpected  since  corre¬ 
lations  measure  the  linear  relationship  of  signals.  Notice  that  (2.10)  does  not  provide  a  computa¬ 
tional  procedure  for  obtaining  the  describing  function,  G[ju)).  In  particular,  ^„(u;)  probably 
has  no  analytical  representation  but  must  be  empirically  derived  based  on  experimentation. 

Finally,  observe  that  equation  (2.10)  was  derived  via  an  open  loop  analysis.  In  a  feedback 
system  like  the  one  of  interest  here,  the  analysis  is  more  complicated  because  the  input  to  the 
nonlinearity  depends  on  the  response.  In  this  case  it  is  usually  assumed  that  the  input  to  the 
nonlinearity  is  Gaussian.  This  is  reasonable  because  if  the  output  of  the  nonlinearity  is  non- 
Caussian.  the  lowpass  characteristics  of  the  controlled  element  tend  to  make  it  more  Gaussian. 
Similarly,  in  the  case  of  a  feedback  loop  containing  sinusoidal  signals,  the  lowpass  controlled  ele¬ 
ment  tends  to  filter  the  higher  harmonics  of  the  output  of  the  nonlinearity  and  thus  restores  the 
sinusoidal  nature  of  the  input  signel. 


With  these  observations  and  the  equation  (2.10),  the  closed  loop  analysis  outlined  below 
McRuer  (19r>9)]  yields  a  representational  equation  for  G{ju))  in  terms  of  measurable  quantities. 


The  cross  spectral  density  between  the  operator  output  and  the  system  input  is 

1 


4*. 


— ^ 

l  +  GH  **  \  ^  GH 


1  +  GH 


*  *1 


(2.11) 


where  the  second  equality  follows  because  the  remnant  is  uncorrelated  with  the  input.  The  cross 
sp)ectral  density  between  the  error  and  the  input  is 


me 


1 

1  ^  GH 


4>. 


(2.12) 


Thus,  dividing  (2.11)  by  (2.12), 


G{juj) 


(2.13) 


Again.  (2.13)  permits  the  empirical  determination  of  G(ju>)  from  experimental  data  for  (;ti;) 

and 

Based  on  a  series  of  empirical  studies  involving  aircraft  pilots,  McRuer  1959|  concluded 
that  most  operator  behavior  could  be  well-fitted  by  the  generic  describing  function 


G{ju>)  =  K 


1  + 
1  + 


1  e 

1  + 


(2.14) 


with  an  additive  remnant,  where  the  time  delay  gain  K,  and  time  constants  are 

estimated.  In  general  it  is  thought  that  r^  and  ty  (which  has  come  to  be  known  as  the  "neu¬ 
romuscular  lag"),  are  essentially  inherent  physiological  quantities,  whereas  the  operator's  static 
gain,  K  and  lead  and  lag  time  constants,  r^,  reflect  the  equalization  adopted  by  the  human 
to  achieve  good  closed  loop  performance. 


The  generic  describing  function  (2.14)  has  been  successful  in  describing  human  transfer 
characteristics  for  a  variety  of  controlled  element  dynamics,  but  is  by  no  means  fixed.  McRuer, 
Graham,  and  Krendel  [I967j  note,  for  example,  that  for  low  frequencies  r^y  acts  essentially  to 
increase  the  lime  delay  and  thus  combined  and  r^y  to  yield  an  effective  time  delay  re  and  the 
simpler  form  of  the  G(]uj)-, 

(2.15) 

This  form  has  been  used  in  concert  with  multiple  conventional  stable  controlled  elements  with 
low  input  frequencies  [see,  in  addition,  Levison  and  Elkind  (1967)  .  On  the  other  hand,  in  some 
cases,  such  as  in  the  presence  of  an  unstable  controlled  element  or  higher  input  frequencies,  more 
complicated  structures  have  been  required  to  yield  adequate  matches  with  experimental  data. 
One  of  the  more  common  refinements  has  been  the  inplusion  of  a  catchall  increment  in  the  low 
frequency  phase  angle  to  account  for  the  low  frequency  lags  observed  in  operator  data.  McRuer 
et  al.  (1967),  (1969)]  discuss  these  more  fufly  and  present  gain-phase  plots  for  numerous  con¬ 
trolled  elements,  all  of  which  are  matched  by  some  version  of  the  describing  function. 

Complete  specification  of  a  describing  function  is  a  two  step  procedure,  the  first  of  which  is 
the  sp)ecificalion  of  a  form  such  as  (2.14)  or  (2.15).  Next,  a  strategy  for  the  selection  of  the 
parameters  K,  r^,  Tj  etc.  such  that  the  closed  loop  system  exhibits  "good"  closed  loop  perfor¬ 
mance  is  required.  A  good  system  in  the  classical  sense  should  jsee  for  example  Dorf,  (1974);; 

•  suppress  disturbances; 

•  reduce  the  sensitivity  of  the  system  to  variations  and  uncertainly  in  the  elements  of 
the  system; 

•  provide  good  servo  response  over  the  bandwidth  of  the  inputs; 


G{]uj)  =  K 


1  }-^^L 

1  -f  JUT) 


provide  adequate  gain  and  phase  margins. 
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These  properties  are  classically  analyzed  in  terms  of  the  open-loop  frequency  response. 

where  is  the  transfer  function  of  the  controlled  element,  and  G(ju.)  is  the  describing  func¬ 

tion.  In  principle,  the  goals  set  out  above,  or  at  least  a  compromise  thereof,  can  be  achieved  by 
choosing  the  parameters  so  that  the  open  loop  gain  is  high  for  low  (input)  frequencies,  and  low 
for  high  (noise)  frequencies.  .Adequate  stability  margins  must  be  simultaneously  maintained.  In 
the  "crossover"  model  of  .McRuer  et  al.  '1967  these  requirements  are  met  by  the  selection  of  Tj, 

Ti ,  and  A’  so  that  the  open  loop  gain  behaves  as  an  integrator  in  the  region  where 

^01  ^  *  This  0  dB  frequency  is  called  the  crossover  frequency,  Mathematically, 

Hql  ~  =  — —  e  ’  '  near  ^  .  (2  16) 

J  W’ 

This  equation  specifies  what  Hqi  should  look  like,  but  does  not  indicate  an  automatic  procedure 
for  the  adjustment  of  r^.  ,  ry,  K  and  r,  to  achieve  the  crossover  behavior.  McRuer  and  Jex 

1967  summarize  a  series  of  what  they  call  "verbal  adjustment  rules"  for  adjusting  the  parame¬ 
ters  to  achieve  (2.16).  They  are  not  neat,  sequential  rules  but  rather  guidelines  which  have  been 

applied  and  have  led  to  successful  esuits  for  controlled  elements  with  A  ,  K  s,  A  .s‘,  and 

A  s  (  a  -  r)  dynamics 

Recall,  the  second  component  of  operator  response  which  results  from  the  quasi-linearization 
decomposition  process  is  the  remnant.  The  remnant  is  that  p>ortion  of  the  opierator's  total 
response  which  is  linearly  unrelated  to  the  input.  It  s  existence  is  attributed  to  such  factors  as: 

•  non-linear  human  input  output  response  due,  for  example,  to  indifference  thresholds; 

•  non-steadv  pilot  behavior 

•  intentional  noise  miectioti  by  the  human  to  probe  or  linearize  the  system. 

•  stfvchaistir  variation  intrinsic  in  human  resptonse 
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Methods  for  modeling  remnant  are  not  as  well-developed  as  for  the  linear  portion  of  the 
DFM.  The  existing  models  generally  consist  of  empirically  derived  (1st  order)  noise  spectra 
injected  at  the  operator's  input  or  output  (see  Figure  2.8).  The  point  of  insertion  is  arbitrary  and 
is  not  intended  to  indicate  the  physical  source.  The  equivalent  closed  loop  remnant,  however,  is 
uncorrelated  with  the  input  (and  thus  any  linear  transformation  thereof).  Thus,  the  power  spec¬ 
trum  of  the  output,  4>cc(-^’)'  given  as  the  sum 


12 


1  -H  G (juj) 


(2.17) 


"here  - is  the  equivalent  closed  loop  describing  function.  In  the  optimal  control 

described  in  the  next  section,  the  model  for  that  portion  of  remnant  due  to  the  stochastic 
of  the  human’s  response  is  made  more  explicit. 


model 

nature 
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Human  Operator 

I - T 


a)  Remnant  injected  at  operator’s  input 


Human  Operator 

t - 1 


b)  Remnant  injected  at  operator’s  output 


I  1+GH|‘  '  i  1  ^GH|  ^ 


Figure  2  8:  Remnant  Models 
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Discussion 

Describing  function  models  have  been  used  to  explain  a  large  body  of  manual  control  data, 
1  hiis  justifying  the  approximation  of  man  as  a  linear  element  in  some  situations.  The  best  results 
ha\e  been  obtained  for  systems  in\olving  a  compensatory  tracking  task  with  a  single  display  and 
single  manipulator.  As  previously  mentioned,  excellent  agreement  has  been  obtained  between 
measured  operator/system  describing  functions  and  those  obtained  via  the  crossover  model. 
Bode  plot  comparisons  for  a  variety  of  controlled  element  dynamical  structures  appear 
throughout  the  literature  [see  in  particular  McRuer  et  al.  (1959),  (1969)]. 

Outside  the  realm  of  strict  laboratory  conditions,  describing  function  models  have  also  been 
used  to  predict  human-vehicle  performance,  to  study  stability  or  other  problems  associated  with  a 
particular  manned  system,  and  to  generate  insight  into  the  mechanisms  of  human  perception  and 
control.  As  an  example,  McRuer  and  Weir  il%7|  report  the  use  of  the  crossover  model  to  inves¬ 
tigate  the  importance  of  several  visual  cues  in  a  freeway  driving  task.  They  modelled  driver 
steering  during  overtaking  and  passing  maneuvers  under  good  and  degraded  vehicle  and  environ¬ 
mental  conditions.  The  baseline  case  involved  nominal  vehicle  conditions.  The  second  involved  a 
reduction  of  air  pressure  in  the  rear  tires  which  resulted  in  notably  different  vehicle  dynamics. 
Both  systems  were  second  order,  but  the  second  was  unstable  in  the  open  loop  and  thus  required 
constant  attention  by  the  driver  to  produce  a  conditionally  stable  system.  Four  crossover  models 
were  developed  for  each  system;  one  each  to  model  human  response  to  heading  angle,  heading 
rate,  path  angle,  a. id  path  rate.  The  resulting  driver-vehicle  Bode  plots  revealed  that  heading 
rate  as  an  input  yielded  the  best  system,  especially  under  degraded  conditions.  Experimental 
data  collected  for  the  degraded  system  also  led  the  investigators  to  conclude  that  "the  driver’s 
dominant  response  (under  these  conditions)  is  to  heading  rate".  This  corresponded  to  their 
interpretation  of  the  modeling  results. 

Another  example  of  the  type  of  analysis  afforded  by  models  of  this  type  is  reported  in 
McRuer  and  Weir  [1967].  In  this  case  the  crossover  model  was  used  in  concert  with  a  fourth 
order  air  frame  model  to  predict  pilot  dynamics,  to  derive  a  correlation  between  these  dynamics 
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and  the  pilot's  rating  of  the  aircraft,  and  to  demonstrate  how  stability  problems  can  be  predicted 

and  better  understood  so  that  corrective  measures  can  be  taken.  Details  may  be  found  in  the  * 

reference. 

Extensions  of  describing  function  methods  to  multi-input/multi-output  tasks  have  been 
developed  McRuer  and  Jex  (1969)j,  but  in  general,  application  of  the  approach  to  such  systems  is 
difficult.  This  is  because  there  are  no  systematic  rules  for  selecting  loop  structures  (i.e.  inner 
loops  and  outer  loops),  describing  function  forms  within  a  loop,  or  model  parameters.  Thus,  any 
attempt  at  application  would  involve  a  comprehensive  research  program  supported  by  extensive 
experimentation. 

Even  when  the  task  of  interest  is  of  the  single  input/single  output  type,  there  are  some 
problems  associated  with  the  application  of  the  DFM  to  problems  outside  the  domain  within 
which  it  was  developed.  The  basic  difficulty  lies  in  the  absence  of  systematic  principles  for  choos¬ 
ing  model  structures.  The  model  structures  given  by  (2.14)  and  (2.15)  worked  well  for  the  t^lsks 
referenced,  but  since  they  have  no  real  physical  or  rational  basis,  they  provide  little  guidance  to  a  * 

modeler  addressing  a  new  task.  Additionally,  the  verbal  adjustment  rules  used  by  McRuer  el  al.  • 

1967  lo  find  the  parameters  of  the  model  which  achieved  crossover  requirements  are  not  easily 
adapted  to  new  situations  Thus,  the  modeler  is  faced  with  a  non-trivial  parameter  selection  task 
even  if  he  has  a  model  structure  which  he  has  reason  to  believe  will  work.  A  new  application  of  a 
model  of  this  type,  would  therefore  probably  require  a  large  scale  experimental  program  before  it 
could  be  used  to  describe  even  the  gross  behaviour  of  the  system  with  any  certainty. 

In  an  attempt  to  overcome  the  problems  identified  above,  Anderson  1970  proposed  another 
frequency  domain  approach  for  application  to  V/STOL  aircraft  analysis.  The  model,  called 
"paper  pilot"  is  a  fixed  form  model.  It  is  based  on  the  hypothesis  that  the  pilot  adopts  an  equal¬ 
ization  strategy  which  maximizes  his  impression  of  the  vehicle  handling  qualities.  Mathemati¬ 
cally.  a  model  structure  is  postulated  (that  chosen  by  Anderson  for  the  V  STOL  hovering  task  is 
shown  in  Figure  2.9).  then  the  parameters  are  selected  to  minimize  some  rating  function  which 
weights  pilot  workload  as  well  as  system  performance.  The  problem  of  choosing  model 
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Commanded 
Position 
x,  =  0 


Longitudinal 

Turbulence  Longitudinal 


Pilot  Model 


I. _ j 

Kp_  =  Pilot  gain  for  outer  control  loop 

Tl_  =  Pilot  lead  time  constant  for  outer  control  looop 

6^  =  Pilot  internally  generated  pitch  command  for  inner  control  loop 

Kp^  =  Pilot  gain  for  inner  control  loop 

Tl,  =  Pilot  lead  time  constant  for  inner  control  loop 

T  =  Pilot  effective  time  delay 

Figure  2.9;  Block  Diagram  of  Helicopter  Longitudinal  Hover  Taisk 
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parameters  is  thus  replaced  with  that  of  selecting  a  cost  functional  and  then  performing  a  param¬ 
eter  optimization  (which  may  itself  be  computationally  burdensome).  Anderson’s  approach  was 
applied  to  some  pre-existing  data  for  which  pilot  parameters  were  available  and  yielded  good 
matches  to  the  pilot  ratings  of  their  aircraft.  On  the  other  hand,  when  the  method  was  applied 
to  pred.  t  pilot  parameters  and  pilot  rating,  the  predictions  of  rating  were  good,  but  the  matches 
with  measured  pilot  parameters  questionable.  The  utility  of  this  approach  on  a  different  problem 
thus  remains  unverified. 

Another  limitation  of  the  frequency  domain  models  described  above  is  their  inability  to 
account  for  operator  response  in  the  absence  of  stimulation.  As  an  example,  if  a  target  disap¬ 
pears  temporarily,  the  human  continues  to  track,  based  perhaps  on  velocity  estimates,  until  it 
reap^.cars.  These  methods  have  no  mechanism  for  handling  such  a  situation. 

Finally,  all  frequency  domain  methods  have  the  ultimate  limitation  that  they  are  strictly 
valid  only  under  stationary  operating  conditions.  An  interesting  aspect  of  TOV  analysis,  how¬ 
ever,  involves  the  operator’s  ability  to  respK>nd  to  time  varying  (in  particular  suddenly  changing) 
features  of  the  environment  which  have  a  direct  influence  on  the  dynamical  structure  of  the  total 
systerr;.  Related  situations  have  been  addressed  by  several  researchers  in  the  peist  Phatak  and 
Bekey  (1969),  Elkind  and  Miller  (1%7)].  Their  models  were  motivated  by  a  desire  to  model 
human  adaptive  capabilities  in  the  event  of  a  system  failure  which  results  in  a  sudden  change  in 
the  system  dynamics.  The  model  of  Elkind  and  Miller,  for  example,  addresses  a  situation  in 
which  the  human  operator  controls  a  system  which  is  subject  to  sudden,  random  changes  in  pro- 
'-pss  dynamics.  They  assumed  that  the  controlled  system  consists  of  a  finite  number  of  linear, 
time  invariant  subsystems,  and  that  the  operator  has  an  internal  model  for  each.  At  a  random 
time,  the  system  switches  from  one  mode  to  another  They  assume  that  the  operator  is  aware  of 
possible  transitions  and  is  well  trained  in  dealing  with  them.  The  authors’  proposed  model  relies 
on  the  crossover  model  to  describe  the  human’s  behavior  between  changes,  and  methods  of  sta¬ 
tistical  decision  theory  to  model  his  adaptive  behavior  when  a  change  occurs.  The  model  for  his 
adaptation  consists  of  three  stages:  detection,  identification  and  modification.  In  the  first  stage 
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he  detects  changes  based  on  a  comparison  of  expected  and  actual  error  rate.  In  the  second,  he 
identifies  the  type  of  change  which  has  occurred.  Finally,  he  selects  his  new  control  behavior  and 
rapidly  adjusts  to  the  new  steady-state  crossover  model.  The  model  emphasizes  the  detection 
and  identification  phases;  the  experimental  program  was  correspondingly  designed  to  verify  pred¬ 
ictions  of  the  subjects’  times  of  detection  and  identification  of  changes.  The  control  dynamics 
which  result  after  identification  has  occurred  are  not  addressed  by  this  model,  but  since  transi¬ 
tions  are  assumed  to  occur  infrequently,  the  dynamics  during  the  transition  period  have  little 
effect  on  overall  system  performance.  In  a  TOV  encounter,  system  changes  occur  frequently  and 
the  operator's  response  during  transitions  is  more  critical.  Weir  and  Phatak  |1966]  addressed  the 
transition  problem  with  the  addition  of  another  stage  between  the  "identification"  and  "post- 
transition  steady  state"  control  stages.  During  this  intermediate  period,  they  modeled  the  opera¬ 
tor  as  a  time  optimal  controller  who  acts  to  null  the  error  which  has  accumulated  during  the 
detection  identification  stage  before  assuming  a  new  steady-state  control  strategy.  The  model 
explains  the  bang-bang  control  behavior  exhibited  by  controllers  in  situations  where  large  errors 
have  accumulated. 

The  frequency  domain  methods  presented  in  this  subsection  are  not,  in  general,  easily 
adapted  to  time  variable  systems.  A  typical  TOV  encounter,  however,  is  characterized  by  very 
pronounced  time-variability;  that  is  by  frequent,  sudden  changes  in  the  dynamical  system  struc¬ 
ture.  The  environmental  features  which  indicate  these  changes  (to  the  operator)  may  be  ambigu¬ 
ous  and  hidden  in  noise.  Due  to  their  moderately  high  frequency,  the  operator  s  control  behavior 
during  detection  and  transition  stages  is  reflected  in  overall  system  performance.  For  a  system 
such  as  this,  a  time  domain  approach  derived  from  that  embodied  by  the  Optimal  Control  Model 
(OCM).  may  provide  a  more  natural  avenue  for  modeling  the  system. 
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The  Optimal  Control  Model 

The  representative  model  from  the  second  cl<iss  of  control  theoretic  human  operator  models 
is  the  Optimal  Control  Model  (OCM)  of  Baron,  Kleinman  and  Levison  '(1969),  (1971)'.  Since  its 
development,  the  model  has  been  applied  to  a  wide  variety  of  manual  control  problems  with  con¬ 
siderable  success.  An  extensive  backlog  of  empirical  data  validates  the  Mlity  of  the  model  to 
mimic  human  performance  of  manual  control  tasks.  These  include,  for  example,  car  following 
IBekey  (1977)],  Remotely  Piloted  Vehicle  (RPV)  operation  [Grunwald  and  Merhav  (1976), 
(1978)],  AAA  tracking  iPhatak  (1977)],  tank  tracking  [Kleinman  (1981)],  and  V/STOL  hovering 
Baron  and  Kleinman  (1971),  Kleinman  and  Killingsworth  (1981)].  More  recent  research  is 
devoted  to  the  extension  of  the  OCM  methodology  to  supervisory  and  multiple  task  control  prob- 
lem.3  which  involve  increased  decision  making  and  reduced  manual  participation  on  the  part  of 
the  human.  For  example,  models  have  been  developed  for  the  DEMON-multiple  RPV  operation 
task  (Mulradin  and  Baron  (1980)]  and  the  AAA  flight  crew  (AAACRU)  Zacharias  et  al.  (1982)  . 
These  models  remain  to  be  empirically  validated. 

The  success  of  the  OCM  is  attributed  primarily  to  the  flexibility  of  the  modeling  technique 
in  treating  multi-dimensional,  time  variable,  nonlinear  and  nonsteady  state  stochastic  control 
problems  within  the  well-developed  theory  of  state  variable  optimal  control.  Multiple  tasking, 
monitoring  and  attention  sharing  are  easily  incorporated  into  the  model  structure  Pattipati  et  al. 
(1983),  Kleinman  and  Curry  (1977),  Levison  et  ai  (1971)]  as  long  as  the  system  can  be  described 
by  linear  stochastic  differential  equations.  A  description  of  the  model,  a  review  of  some  applica¬ 
tions,  and  a  discussion  of  the  model  follow. 

The  Optimal  Control  Model  is  based  on  the  assumption  that  the  well-trained,  well 
motivai.ed  human  operator  behaves  in  a  near  optimal  fashion  subject  to  his  internal  limitations 
and  understanding  of  the  task.  This  underlying  assumption  is  not  unique  to  this  model.  In  fact, 
the  "crossover  model"  described  in  the  last  section  is  based  on  a  similar  point  of  view.  What 
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difr('rentiates  this  model  is  the  method  for  representing  human  limitations  and  the  structuring  of 
tin  model  lo  include  components  which  compensate  for  them. 

A  block  diagram  of  the  standard  optimal  control  model  is  given  in  Figure  2.10.  At  the  top 
i>f  the  loop  is  the  model  for  the  physical  system  (actuators/  machine/environment).  It  is 
assumed  to  be  described  by  stochastic  differential  equations,  in  particular,  differential  equations 
driven  by  continuous  random  inputs.  The  state  variables  in  such  a  model  are  hereafter  termed 
"base  stales".  These  could  include,  for  example,  position,  velocity,  pilch  or  angular  rate. 

Al  the  bottom  of  the  diagram  is  the  block  containing  the  model  for  the  human  operator. 
The  inputs  are  the  sensory  information  available  to  the  human.  These  could  include  visual,  audi¬ 
tory  and  or  vestibular  data  as  a  function  of  the  application.  These  "displayed"  variables  are 
assumed  to  be  linear  combinations  of  the  b<ise  state  variables. 

In  contrast  to  the  unstructured  describing  function  model  of  the  last  section,  the  OCM  is  a 
structured  model.  It  reflects  a  homomorphic  map  of  the  three  psycho-motor  functions  of  percep¬ 
tion,  information  processing  and  control  actuation.  The  block  representing  perception  transforms 
displayed  sensory  data  into  that  which  is  perceived  by  the  human.  The  information  processing 
block  mathematically  describes  the  human’s  ability  to  deduce  information  about  the  current  state 
given  the  perceived  variables.  Finally,  the  actuation  block  models  the  human’s  generation  of 
commanded  and  realized  controls  as  a  function  of  the  estimated  base  states.  It  should  be 
emphasized  that  although  these  blocks  are  organized  to  structurally  match  human  functions, 
there  is  no  attempt  to  define  the  actual  mechanism  by  which  these  functions  are  accomplished. 
Indeed,  the  mathematical  techniques  employed  to  model  the  performance  of  these  functions  sim¬ 
ply  provide  a  model  structure  and  a  means  for  parametrically  describing  his  total  response.  This 
has  proved  to  be  adequate  in  many  applications.  The  system  model  and  mathematical  forms 
contained  in  each  block  are  now  presented. 


C^ontrollerl  System 


Figurt*  2.10;  Optimal  Control  Model 
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System  Model 

The  system  mode)  is  given  by  a  set  of  linear  stochastic  differential  equations  which  comprise 
the  linearized  dynamics  of  the  controlled  system  and  environment.  Dynamics  associated  with  the 
meaNuremeni  and  control  subsystems  are  also  assumed  to  be  included  here.  The  equation  for  the 
dxnamic  model  is 

dxi  =  AXfdt  -t-  Bufdt  dWf  (2.18) 

where  x,  is  an  n-vector  of  dynamic  ba.se  states,  u,  is  an  r-vector  of  control  variables  and  te,  is  an 
n -vector  of  Brownian  motion  with  intensity  W'; 

{dw,)  (du-i)  =  Wdt  .  (2.19) 

A  and  B  are  (m  x  m)  and  (m  x  r)  linear  transformation  matrices,  respectively.  They  are 
assumed  time  invariant  for  convenience,  but  the  method  applies  to  the  time  variable  case 
directlv . 

The  displayed  variables  are  assumed  to  be  linear  combinations  of  the  base  state  variables; 

y*  =  Dx,  (2.20) 

where  D  is  a  linear  transformai 'on  matrix  and  yf  a  p-vector.  The  vector  yf  snould  contain  all 
sensory  information  available  to  the  human  which  might  aid  his  understanding  of  the  system 
state.  This  could  include  visual,  auditory  and  vestibular  information  as  a  function  of  the  system 
configuration.  For  example,  in  an  in  situ  car  driving  task,  the  driver  observes  his  relative  posi¬ 
tion  (with  respect  to  a  car  ahead  or  the  center  of  the  road),  relative  velocity  and  perhaps  other 
"visual"  information.  These  are  easily  expressed  as  linear  combinations  of  position  and  velocity 
states.  Additionally,  he  might  physically  sense  the  acceleration  of  the  car  as  he  traverses  a  shar¬ 
ply  winding  road,  i.c.  "vestibular"  information.  This  information  is  again  expressible  in  terms  of 
linear  combinations  of  acceleration  states.  It  is  clear  than  an  expedient  form  of  the  state  vector 
relative  to  the  sensed  information  should  be  selected. 
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The  structure  of  y*,  and  the  associated  problem  of  modeling  it,  vary  in  complexity.  In  the 
case  that  the  display  is  an  instrument  panel  or  a  simple  display  composed  of  well  defined  data 
(such  as  one  might  encounter  in  a  tracking  or  laboratory  control  task),  the  structure  of  y/  is  more 
or  less  self-evident  When  the  display  is  a  cluttered  visual  scene,  however,  the  composition  of  the 
vector  y*  is  not  as  obvious.  One  method  for  constructing  it  involves  the  augmentation  of  the 
standard  display  vector  (i.e.  a  vector  composed  of  such  things  as  centerline  displacement  and 
relative  velocity  in  a  tracking  or  driving  task)  with  an  observation  of  a  "feature"  vector  which  is 
compiosed  of  additional  clues  about  the  system  and  environment.  In  some  applications,  the  infor¬ 
mational  content  of  the  additional  clues  can  be  significant  and  their  inclusion  in  the  model  for  the 
display  thus  mandatory.  In  fact,  it  is  the  ability  of  a  human  driver  to  quickly  extract  (with  var>- 
ing  degrees  of  certainty)  these  features  which  distinguish  him  from  the  current  autonomous  pilots 
and  enables  him  to  perform  more  capably  in  a  cluttered  environment.  This  method  is  discussed 
more  completely  as  part  of  the  generalized  operator  model  in  Section  3 


-  44  - 


Human  Operator  Model 

The  mathematical  models  used  in  the  OCM  to  model  the  three  functions  of  perception, 
information  processing  and  control  generation  are  now  presented. 

Perceptual  Processor: 

A  human  operator  has  certain  limitations  which  interfere  with  the  process  of  perception, 
thus  preventing  him  from  making  perfect,  instantaneous  observations  and  nterpretations  of  the 
system  display.  The  OCM  accounts  for  these  limitations  by  the  inclusion  of  an  equivalent  per¬ 
ceptual  time  delay  and  lumped  observation  poise.  The  association  of  the  perceived  variables,  y* 
w  ith  those  displayed,  is  thus  given  by  the  equation 

y?  ^  y*.r  ^ '’i  T  (2.21) 

where  v,  is  a  vector  of  Gaussian  white  noise  which  is  independent  of  all  other  noise  processes  and 
has  intensity  R,,  and  r  is  the  equivalent  perceptual  delay.  The  noise  is  generally  attributed  to 
such  things  as  errors  in  perceptual  resolution  or  central  processing,  but  the  mechanism  for  its 
generation  is  not  important  to  the  model.  The  model  is  completely  specified  by  the  selection  of 
the  noise  intensity,  R,,  the  delay  r,  and  the  identification  of  any  dynamics  associated  with  the 
perceptual  process  or  display. 

R,  reflects  the  quality  of  the  information  in  the  display  vector  as  perceived  by  the  human. 
The  value  is,  of  course,  display  dependent.  There  are  no  set  rules  available  for  its  selection,  but 
certain  guidelines  have  been  suggested.  In  general,  R,  is  chosen  t  be  proportional  to  the  mean 
squared  value  of  y,^; 

(«.),  =  (y').'  (2.22) 

which  defines  as  the  noise/signal  ratio  of  the  displayed  variable  type.  A  value  of  P^ 
corresponding  to  -20  dB  Kleinman  et  al.  (1971  )J  has  been  appropriate  for  a  variety  of  single  axis 
tracking  tasks.  If  indifference  thresholds  are  important,  they  can  be  accommodated  at  this  point 
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by  the  use  of  a  more  complex  form  of  (2.22).  Another  guideline  for  the  selection  of  /?,  in  complex 
multivariable  tasks  is  given  by  the  attention  sharing  model  of  Levison  et  al.  ,1971..  The  model  is 
based  on  the  assumption  that  P,  is  essentially  constant  and  that  P, ,  the  noise/ signal  ratio  associ¬ 
ated  with  the  ith  display  variable  may  be  selected  as 


(2.23) 


where  is  the  fraction  of  attention  devoted  to  the  task,  and  /,  is  the  subfraction  of  the 
operator's  attention  given  to  (y/), 

Values  for  r  have  been  shown  to  be  essentially  constant  and  on  the  order  of  .2  i  .05  sec. 

Kleinman  et  al.  (1971):. 

Some  systems  exhibit  perceptual  bandwidth  limitations  as  a  result  of  the  dynamic  proper¬ 
ties  of  the  display  or  internal  processing  by  the  human.  In  these  cases,  the  associated  lags  can  be 
incorporated  directly  into  the  model  for  the  system  dynamics,  (2.18),  and  need  not  be  treated 
separate!)  here. 
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Jti formation  Processor/Control  Generator: 

The  assumptions  about  the  human  operator’s  task  comprehension  and  control  strategy 
wiiifh  admit  the  formulation  of  the  OCM  are  now  made  precise.  The  operator’s  control  objective 
IV  U)  apply  an  input  to  u,  to  the  dynamic  system  (2.18)  so  as  to  minimize  a  cost  functional  of  the 

furni 


J,  (u)  = 


+  u,  .\ti, )  ds  I  Yf 


(2.24) 


where  Y,  is  the  filtration  generated  by  the  observations  of  the  system,  V,  =  {  y/;  *  $  t  },  is 

as  in  equation  (2.21),  and  Af  and  .V  are  appropriately  selected,  positive  semidefinite  and  positive 
definite,  respectively,  weighting  matrices  which  should  reflect  relative  costs  associated  with  the 
\arious  states  and  controls.  In  applications,  the  parameters  of  these  matrices  are  the  primary 
means  by  which  the  modeled  operator  response  is  shaped  to  match  actual  data.  In  some  applica¬ 
tions.  a  Nveighting  on  control  rate,  u,  is  also  included  instead  of  or  in  addition  to  that  on  v, .  This 
issue  is  addressed  in  the  discussion. 

The  control  selected  to  minimize  (2.24)  must  be  chosen  from  the  class  of  corresponding 
admissible  controls,  i^.  This  class  of  functions  is  defined  by  the  following  prop>erties  'Tse  (1971  )j: 
First,  the  function  at  any  time  must  depend  only  on  past  observations.  Mathematically,  u((,y) 
must  be  Yj  measurable.  It  must  thus  have  the  form 


«(t,y)  =  7(<,y,:  8^0 


(2.25) 


Such  a  control  is  termed  non-anticipative.  Equation  (2.25)  is  stated  equivalently  in  terms  of  a 
stopped  function 


y.  t,<s<t 

y,  t  <  s  <  T 


Then, 


«(''.y)  =  7(t,y') 


(2.26) 
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where  is  a  mapping  from  R  x  T  —  /?',  and  T\  is  the  class  of  continuous 

functions  from  f, ,  T  ->  .  Second,  a  condition  which  assures  the  existence  of  a  unique  (in  pro¬ 

bability  law  )  solution  to  (2.18),  (2.21),  such  that 

E{]  X,  *  Y,}<  .Vf,  ^ 

and 


/  I  I  *  rft  ■  K. 


<00  k>0 


is  required.  A  uniform  Lipshitz  condition  on  7 

‘'l{t,g)-'y{3j)  II  $  a'!  /-ff  He;  f  ,geC^(t^,  T) 


(2.27) 


(2.28) 


where  ^  is  the  usual  sup  norm 


il  •  =  sup  II  •  II 

,  T\ 

and  is  the  Euclidean  norm,  provides  a  sufficient  condition. 

Thus,  the  problem  is  summarized  as  finding  a  control  of  the  form  (2  25),  satisfying  (2.28). 
which  minimizes  (2.24)  subject  to  the  dynamic  constraints  (2  18).  under  the  additional  assump¬ 
tion  that  the  human  has  an  internal  model  for  the  system.  The  separation  principle  of  stochastic 
control  provides  the  framework  for  its  solution. 


THEOREM  1  -  The  Separation  Theorem,  adapted  from  Fleming  and  Rischel.  Ch.  V  1975 
1.  Equation  for  the  conditional  mean:  Suppose  z,  and  y,  are  stochastic  processes  which  satisfy 

dx,  =  {Ax,  Bu,)  dt  *  dw,  (2  29) 

dy,  '  Dx,  dt  *  dv,  (2  30) 

where  y,  is  observed,  and  w,  and  v,  are  independent  Browniac  motions.  Let  V,  denote  the 
observation  rr-algebra.  F,  =  cr  {  y,  s  <  t  },  Then,  the  conditional  mean  f,  E(x,  Vj). 
obeys  the  linear  stochastic  differential  equation 
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dx,  =  (Ax,  ■+  Bu,)  dt  4  P,D  dif, 
where  i>,  is  the  "innovations  process"  defined  by 

di/,4  dy,  -  Dx,  dt 

and  P,  is  the  error  covariance,  £  {  (i,  -  it)  (^t  -  ^i)  and  is  non-random. 

2.  The  cost  functional  (2.24)  can  be  rewritten  in  terms  of  the  conditional  mean  as 


J,(u)  =  E 


f  (^,  «.  I  y, 


4Z, 


where 


1 

r 

T 

II 

1  J*  X,  'Mx,ds  \  Y, 

=  E 

J  tT(MP,)ds  1  Y, 

t 

\ 

(2.31) 


(2.32) 


(2.33) 


3.  Let  u  denote  the  class  of  admissible  controls.  The  original  stochastic  control  problem  with 
partial  observations, 


min  ^  I  ^  (^1  )  Xi 


s.t.  (2.29),  (2,30) 


thus  has  the  equivalent  formulation  as  a  stochastic  control  problem  with  complete  observa¬ 
tions 


/ 


min  E  <, 


J  (z,'.V/f,  +  u.'AuJd*  y, 


Z,  s.t.  (2.31) 


Furthermore,  since  Z,  is  independent  of  the  control,  the  problem  may  finally  be  expressed  as 
the  standard  stochastic  linear  regulator  problem 
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min  E 


u,  \u,  )  ds 


y,  \  s.l.  (2.31) 


/ 


(2.34) 


The  problem  has  thus  been  transformed  into  a  linear  stochastic  control  problem  with  acces¬ 
sible  Slate  and  quadratic  cost.  The  solution  to  thi.s  problem  (2.34)  is  well-known  e.g.  Fleming 
and  Rischel  Ch.  \  I.  1975  : 

u  (t,f()  =  -  ' K,Xf  =  L  X,  (2.35) 

where  f,  is  as  above,  and  K,  is  the  solution  of  the  matrix  Riccati  equation 

E,  A',  .4  4  A,  -  K,B\  -  M  (2.36) 

with  boundar>  condition  hj  0. 

The  proposition  is  staled  for  the  ca.se  of  nondelayed  observations.  Kleinman  1969  showed 
that  in  tne  time  delayed  ca.se,  the  separation  property  still  holds  wiif.  the  mmse  filtered  estimate, 
i,  =  £  {x,  y,  }  replaced  by  the  mmse  predicted  estimate,  i,'  -  £  {r,  V,  }. 

The  significance  of  the  proposition  is  as  follows;  The  original  problem  (2.1«)  (2.21)  (2.24) 
can  be  solved  as  two  separate  problems,  one  of  estimation  and  the  oUier  of  control  F  irst.  find 
'he  mmse  estimate  of  z,  given  the  observ  at  and  second,  solve  the  equr.  aU'rit  .stochasiii  con¬ 

trol  problem  with  accessible  stale  via  equations  (2  35),  (2.36).  In  accordance  with  the  block 
diagram  in  Fig.  2.10,  the  two  phases  of  solution  are  now  addressed  in  the  individual  blocks 
termed  "information  processor"  and  "control  generator,"  respeciivelv  The  equations  associated 
with  each  by  the  OCM  are  now  presented. 


Information  f’rnrr shot 

The  information  processing  block,  the  heart  of  tfie  fX  M  cousi  -i-  !,f  a  Kairi,an-F3uc  v  tiiiei 
cascaded  wiifi  an  optirnai  predictor.  Ffiese  generate  the  mmse  esTunates  of  itie  '-\stem  states 
given  the  delayed,  noisy  ofiservai  loiis  y,f  F.st  imate<-  of  the  mean  'squared  uiKcrta.'  in  the  state- 
estimates  are  also  generated,  thus  enabling  computation  ol  iFie  optimal  cus'  via  f.quation  (2,33). 
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The  use  of  the  Kalman-Bucy  filler  implicitly  assumes  that  the  operator  knows  the  linearized 
stale  \ariable  representation  of  the  system  dy  namics  and  external  stochastic  disturbances;  he  has 
an  interna!  model.  This  assumption  is  strong,  but  experience  has  shown  small  errors  in  the  model 
can  usual!}  be  compensated  by  the  in’rnduction  of  noise  into  the  mode)  at  the  process  and  motor 
le\els.  The  human's  model  for  the  system  is  thus  given  by 

dz,  -  (A  \  i  Bu,)  dt  »  dw,  (2.37) 


w  here 

E  {  dw,  dw',  }  =  Wdl . 

all  other  quantities  have  the  same  interpretations  in  Eqn.  (2.18).  and  A  and  B  are  "close"  to  A 
and  B 

The  human's  observation  of  the  system  is  given  by  the  output  of  the  perception  block; 

y!'  ^  Dx,_,  ^  v,^,  (2.38) 

where  equations  (2.20)  and  (2.21)  have  been  combined.  Equation  (2.38)  has  the  equivalent 
represent  a!  luri 

dy,  ^  *  dn,  ,  (2.39) 


where  y.  .s  the  solution  of  the  equation 


dy 


dt 


y, 


and  r/,  it  the  Brownian  motion  process  from  which  the  while  noise  r,  is  derived.  Formally. 


dr), 

dt 


and 


dT),d  Tj,  -  R,dt 

Given  the  pair  of  equations  (2.37),  (2.39),  an  equation  for  the  evolution  of  the  mmse  estimate  of 
z,  given  the  observations  {  y,  ,  s  t"  <  }  is  desired.  Kleinman  1969  showed  that  the  solution  of 
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this  problem  is  obtained  as  a  cascade  combination  of  a  Kalman-Bucy  filter  and  an  optimal  (least 
mea*--square)  predictor.  The  derivation  in  the  reference  is  complete  and  only  the  equations  for 
each  stage  are  presented  here. 

In  the  first  stage,  the  best  mmse  estimate  of  the  delayed  state.  given  the  observations 

up  to  time  1.  {  y,  s  ^  <  },  is  determined.  Since  dy,  Dxi^^dt  dr)f_^  define  a  new  observa¬ 
tion 

=  Vt.f  • 

Then 

=  <^y(+r  =  DXfdt  -r  dr),  .  (2.40) 

The  problem  now  assumes  a  more  standard  form  and  can  be  restated  in  more  standard  terms. 
That  is,  find  the  best  mmse  estimate  of  Xt  given  the  observations  {  y, :  s  t  ).  The  solution  to 
this  problem  is  known  to  be  given  by  the  conditional  mean, 


i,T‘*  ^  =  E(X,_,\  Z^,,) 

where  Z,  is  the  observation  tr-algebra,  <t  {  s  C  t}.  The  well  known  Kalman-Bucy  filter 

gives  the  equation  of  evolution  for  this  quantity  via  the  pair  of  equations 

di, . ,  =  {Ax,_^  ^  B«,_,)dt  ^  P,  R  ^di^,_^  (2.41) 

P,  =  AP,  ^  P,A  '  -  W  -  P,D'R-^DP,  (2.42) 

where  u,  is  the  "innovations  process"  given  by 

di/,  =  dz,  -  Cx,  dt  , 

and  P,  IS  the  covariance  matrix  of  the  error,  x,  ^  x,  -  i, .  Observe  that  the  equation  for  the 
error  co-ariance  matrix,  P,,  is  an  ordinary  differential  equation,  and  P,  itself  a  non-random  pro¬ 
cess.  This  is  a  direct  consequence  of  the  assumption  that  the  state  vector  is  composed  strictly  of 
"base"  states.  That  is,  the  disturbances  are  strictly  of  the  Oaussian  white  noise  type. 
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In  the  second  stage,  the  optimal  linear  predictor  generates  the  present  state,  z/  from  f, 

dcrording  lo 

z/  =  ^  -  fr-rj 

^  Buf  (2.43) 

v\here  i/  denotes  the  least-mean  square  prediction  of  z,  given  observations  delayed  by  r, 
{  y,  s  S  /-r}. 

This  completes  the  solution  of  the  first  problem  associated  with  the  implementation  of  Pro¬ 
position  2.1.  Notice  that  the  K-B  filter  reflects  compensation  on  the  part  of  the  operator  for  his 
perceptual  limitations  as  modeled  by  the  lumped  observation  noise,  v, .  The  optimal  predictor 
compensates  optimally  for  the  delay. 

Control  Generator: 

This  block  models  the  operator’s  generation  of  the  control.  A  "commanded"  control  is  gen¬ 
erated  by  the  equation 

u '  =  -  L  z,  .  ('^.44) 

where  L  is  the  matrix  of  optimal  gains  generated  as  the  solution  of  the  pair  of  equations  (2.35), 
(2.36).  To  eutcount  for  the  human’s  inability  to  generate  perfect  control  responses,  an  equivalent 
"motor"  noise  is  added  to  The  result  is  filtered  to  account  for  possible  bandwidth  limitations. 
The  motor  model  is  thus  given  as 

Tf/Uf  u,  =  Uf  -r  r,”  (2.45) 

where  r"  is  eissumed  to  be  Gaussian  white  noise  with  intensity  S,  and  Ty  is  the  so  called  neuro¬ 
muscular  lag  which  has  been  found  to  be  an  essentially  inherent  parameter  on  the  order  of 


.08  -  -  0.1  sec.  5,  is  defined  by 
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\\h*rf  Kleinman  1971  reports  a  typical  value  of  =  -25  dB.  The  neuro-muscular  dynamics 
reflecied  by  equation  (2.45),  can  be  incorporated  directly  into  the  dynamic  model  (2.37)  by  the 
augmentation  of  the  state  vector,  x, ,  with  u,  to  yield 


2 

0 

/  0 

du' 

uf 

dt  T 

u 

i 

1 

i 

0  — ^ 

r,. 

(2,46) 


which  has  the  same  form  as  (2.37).  Observe  it  is  the  commanded  control  u[  which  is  actually 
selected  to  minirni2e  (2  24). 


Discussion 

The  Optimal  Control  Model  provides  a  mechanism  for  describing  human  controller  response 
in  a  non- stationary  environment.  The  model  is  a  simple  parametric  one:  the  structure  is  fixed 
and  the  parameters  are  then  selected  to  shape  the  operator's  modeled  response  to  match  experi- 
menta'i  data.  In  this  sense,  the  OCM  is  not  far  from  the  de.scribmg  function  models  discussed 
above.  Recall  the  two  step  procedure  for  obtaining  the  DFM;  first,  a  describing  function  struc¬ 
ture  is  selected;  and  second,  the  parameters  of  the  model  are  chosen  to  give  the  best  match  with 
experimental  data  In  fact,  one  would  hope  that  in  a  stationary  environment  the  two  models 
would  converge  to  yield  a  single  human  oj>erator  model.  The  above  expectation,  although 
theoretically  reasonable,  may  not  be  strictly  realizable  with  the  OCM  in  its  current  form.  This  is 
because  the  OCM  is  not  necessarily  parsimonious.  Other  structures  could  exist  which  are  equally 
adept  at  matching  measured  human  response  data.  Phatak  1977  ,  for  example,  proposed  alter¬ 
nate  optimal  control  structures  which  involved  some  simplifications  to  the  standard  model. 
Examples  of  modifications  and  simplifications  used  by  him  and  others  include: 

•  elimination  of  the  perceptual  delay; 
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•  aggregation  of  the  motor  and  observation  noise; 

•  revision  of  the  terms  in  the  cost  functional; 

•  reduction  of  the  number  of  displayed  variables  the  human  is  assumed  to  perceive. 

.\n  argument  for  the  elimination  of  the  perceptual  delay  is  as  follows.  If  the  environment  is 
predictable  and  the  operator  well-trained,  he  is  able  to  compensate  for  the  delay  and  minimize  its 
effect.  The  same  function  is  accomplished  mathematically  by  the  optimal  predictor  in  the  infor¬ 
mation  processing  block  of  the  OCM.  For  this  reason  Phatak  and  others  (e.g.  Baron  and 
Le  vison  1973  )  have  eliminated  the  delay  from  the  model.  Other  researchers  jHess  (1977)  have 
chosen  instead  to  approximate  the  delay  with  a  Fade  approximation  which  is  then  incorporated 
directlc  into  the  system  dynamics.  The  elimination  of  the  predictor  from  the  model  in  either  case 
greatly  reduces  the  computational  burden  in  application.  Aggregation  of  the  motor  and  observa¬ 
tion  noise  is  similarly  justified  with  an  argument  that  the  operator  is  well-trained. 

Ir  the  OC.M  presentation  above,  a  weighting  on  control  (u,)  was  included.  This  was  the 
case  in  the  original  development,  but  in  this  early  work,  jKleinman  (1969)  ,  the  neuromuscular 
lag  was  not  included  in  the  model.  Subsequently,  the  model’s  developers,  Kleinman  et  al.  1971  . 
included  instead  a  weighting  on  control  rate.  Then, 

T 

J,  =  E  J  -f  ii\Nu,)ds  1  y,  ■  (2.47) 

■ 

This  formulation  gave  better  matches  with  experimental  data.  Additionally  it  was  noted 
that  highly  trained  pilots  rarely  make  rapid  control  movements  and  thus  the  inclusion  of  this 
term  in  the  cost  functional  is  physically  rea.sonabie.  The  result  of  this  revised  cost  functional  is  a 
control  law  of  the  form 

rvti,(f )  -  u,(0  =  -  i  (2.48) 

* 

where  u„(t)  represents  the  best  estimate  of  the  motor  noise  u„,(r),  is  a  matrix  of  optimal 


gains,  and  all  other  notation  is  as  previously  defined,  ryy.  is  functionally  related  to  the  weighting 
in  I  2.47).  The  motor  noise  is  assumed  to  be  a  (wide  band)  first  order  noise  process  generated  by 

“m(0  (2  49) 

v^her^  is  (laussian  white  noise  with  covariance  S.  At  this  point,  in  the  interest  of 

simplification,  two  assumptions  were  made.  First,  based  on  evidence  reported  to  exist  by  Klein- 

man  et  ai.  1971  ,  it  was  eissumed  that  the  bandwidths  of  u^(t)  and  were  approximately 

—  •  *  * 
equal.  Then.  7  =  1  •  Second,  it  was  assumed  that  «  L  f,  and  may  thus  be 

neglected.  This  is  a  reasonable  assumption  since  is  a  wide  band  process  and  at  any  time 

“mfO  ^  These  assumptions  yield  the  simplified  sub-optimal  control  law 

Tf^u,  ^  u,  =  -  L  i,  ^  v„{l}  ,  (2.50) 

which  "introduces"  the  neuromuscular  lag  discussed  earlier.  In  this  formulation,  the  weighting  on 
u  (i.e.  the  matrix  .V)  must  be  adjusted  to  yield  an  appropriate  value  for  r^. 

Since  the  evolution  of  the  model  outlined  above,  the  existence  of  the  contrc’  dynamics  or 
neuromuscular  lag  given  by  (2.50)  has  been  fairly  well  acknowledged-  Many  researchers  since 
Bekey  (1977),  Hess  (1977),  among  others  have  opted  to  include  the  lag  expli(.itlv  Correspond¬ 
ingly.  they  introduce  a  weighting  on  actual  control  in  the  cost  functional  (u,).  This  approach  is 
expedient  because; 

•  is  a  relatively  invariant  parameter  from  task  to  task.  Its  explicit  inclusion  thus  al¬ 
lows  it  to  be  selected  a  priori  and  then  modeled  directly  in  the  system  dvnamics. 

•  The  inclusion  of  a  weighting  on  control  rather  than  control  rate  simplifies  the  initial 
parameter  selection  process  In  this  configuration  .Vf  and  A  may  be  chosen  *0  reflect 
rriaxirnuir  allowable  deviations  in  I  fie  imp'ii  taiit  variables.  T  tiese  are  based  on  physi¬ 
cal  considerations  and  often  indirate  a  good  initial  estimate  of  the  parameters. 
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The  parameter  selection  process  is  discussed  further  in  the  applications  section  which  fol¬ 
low  b. 

Tlie  information  processor  block  of  the  OCM  produces,  in  addition  to  the  best  estimate  of 
sNstem  state,  f ^ ,  the  error  covariance  P,,  and  the  innovations  process  .  These  quantities  have 
imporiani  implications  beyond  those  discussed  above,  and  their  availability  contributes  to  the 
flexibility  of  the  OCM  approach.  For  example,  the  innovations  process  provides  a  key  quantity- 
used  in  signal  processing  to  detect  events  (such  as  system  failures).  Additionally,  z,  and  P, 
together  represent  a  sufficient  statistic  (in  the  case  of  Gaussian  white  noise  disturbances)  for 
describing  the  human's  understanding  of  the  system  state.  They  thus  provide  the  key  variables 
upon  which  decisions  about  system  operation  can  be  made.  For  example,  in  a  situation  involving 
multiple  tasks  or  a  cost  on  monitoring,  a  strategy  for  monitoring  is  required.  This  strategy 
would  be  determined  based  on  z,  and  P,.  Similarly,  consider  a  problem  in  which  the  human  has 
the  option  of  operating  in  one  of  several  modes  (for  example  in  a  computer  aided  mode  or  one  in 
w  hich  additional  sensors  or  tracking  aids  are  exploited).  Then  again,  his  choice  of  mode  should 
be  based  on  z,  and  P, .  More  basically,  in  a  simple  two-task  control  situation,  these  quantities 
determine  which  task  he  should  address.  These  ideas  are  suggested  and  expanded  upon  by  Baron 
1984  and  White  1981  among  others. 

Applications 

The  Optimal  Control  Model  has  been  used  both  for  prediction  of  human  behavior  in  known 
dynamical  systems  and  as  a  model  for  the  human  element  during  the  design  and  evaluation 
phases  of  system  development.  A  review  of  some  of  the  applications  which  verify  the  model 
structure  and  are  relevant  to  the  fximting  and  tracking  aspects  of  the  current  task,  provides  a 
perspective  on  the  status  and  utility  of  the  OCM. 

The  baseline  verification  and  validation  studies  were  performed  and  reported  by  Kleinman, 
Baron  and  Levison  1971  .  The  experiments  consisted  of  a  compensatory  tracking  task  in  which 
the  human  was  given  an  explicit  display  of  tracking  error,  e, .  It  was  assumed  that  the  operator 


could  extract  error  rate  e,,  and  e,  was  thus  included  in  the  model  display  vector  yf.  Velocity 

A  A 

( — )  and  acceleration  ( — 7)  control  w'ere  investigated.  The  results  for  both  sets  of  vehicle  dynam- 
*  s  “ 

ics  yielded  good  agreement  between  experimental  and  model  predictions  of  mean-squared  closed 
loop  performance  quantities  (e,,  e,.  and  u, )  Additionally,  plots  of  equivalent  human  describing 
functions  derived  from  the  OCM  vs.  those  measured,  and  of  computed  vs.  measured  human  rem¬ 
nant  spectra  were  in  excellent  agreement.  It  should  be  noted,  however,  that  the  OCM  parame¬ 
ters  in  this  study  were  adjusted  on  line  to  yield  the  best  matches  with  the  data.  Based  on  these 
initial  results,  it  cannot  be  concluded  that  the  OCM  is  predictive  nor  that  the  parameters  are 
independent  of  the  body  of  data.  The  significance  of  this  set  of  experiments  is  simply  that  the 
OCM  IS  capable  of  reproducing  many  aspects  of  human  response.  The  huinan  limiilalion  parame¬ 
ters  used  in  this  set  of  experiments  are  tabulated  in  Table  1 


I  t  aripes  in  aircraft  stability  derivatives  on  rrns  hovering  performance  in  a  VTOL  vehicle.  The 
i  ■ '  ( r  parameters  in  the  OC.Vl  (i.e  the  weighting  matricies  in  the  cost  functional)  were  adjusted  to 
r(  i-  (  :  ihe  nature  of  the  task.  The  pilots  were  instructed  to  minimize  position  error,  so  a  weight- 
i;  i;  v\a<  included  on  that  state  variable.  Additionally,  as  a  result  of  the  knowledge  that  well- 
ira.ned  pilots  avoid  excessive  altitude  changes,  a  weighting  was  included  on  pitch  rate.  The  rela- 
ii\e  weighting  of  the  two  is  somewhat  subjective.  In  this  case  it  was  based  on  some  existing 
experimental  data  which  reflected  the  balance  of  position  error  vs.  pitch  rate  error  in  a  nominal 
flight  configuration.  The  results  of  this  model  were  compared  to  measured  data  obtained  from  a 
wide  range  (in  terms  of  aircraft  parameters  and  disturbance  conditions)  of  simulator  experiments. 
In  most  cases,  the  model's  predicate  performance  agreed  quite  well  (within  ±  Iff)  with  the  data. 
Note  that  a  fixed  set  of  operator  parameters  was  used  for  all  conditions.  A  complete  set  of  plots 
for  the  various  air  frame  characteristics  are  presented  in  the  reference  along  with  a  discussion  of  a 
few  anomalous  points  which  occurred  a?  the  extremes  of  the  (airframe)  parameter  changes.  This 
example  Illustrates  the  OCM  parameter  selection  process,  provides  an  example  of  its  utility,  and 
supports  the  hypothesis  that  the  parameters  associated  with  human  limitations  may  be  relatively 
invariant  for  many  task  types. 

.As  another  example  in  which  the  OCM  was  applied  in  a  predictive  manner,  Kleinman  and 
Perkins  1974  used  the  OCM  in  an  anti-aircraft  (Vulcan  Air  Defense-VADS)  tracking  loop  which 
exhibited  rapidly  varying  system  dynamics.  The  pilot  was  required  to  track  both  azimuth  and 
elevation.  For  each  axis,  a  five  dimensional  state  vector  reflecting  second  order  target  motion 
and  a  third  order  gun-sight-hand  controller  model  comprised  the  system  dynamical  model.  The 
human  operator  model  was  the  OC.M  with  limitation  parameters  selected  a  prion  based  on  previ¬ 
ous  data  as  Ty  -  .1  sec,  r=  .2  ±  .05  sec.,  =  -20dB  and  =  -25dB.  The  cost  functional 
weighted  tracking  error.  Again,  the  model  predictions  of  tracking  -rror  covariances  throughout 
the  trajectories  matched  well  with  human  tracking  error  obtained  from  experiments 


rtif  OCM  tia,"  b»'en  used  in  many  other  tracking  and  weapon  systems  applies'  'ins  which  are 
n  il  (leia  leti  here  Harvey  and  Dillow  (lli74),  Kleinman  (1977)  .  The  latter  reference  presents  a 
;,ari  i<  ular  v  thorougl'  statistical  comparison  of  three  anti-tank  systems  The  OC'M  was  u^-ed  to 
g< nerate  an  ensemble  of  tracking  error  time  histones  for  ea<  h  system  and  each  of  several  target 
!  rajet  t o'ie<-.  The  model-generated  data  were  compared  against  ensembles  of  equivalent  data  lets 
obi  allied  in  field  tests.  Several  methods  of  statistical  cornparisoti  (e  g  ensemble  mean  and  sian- 
dard  deviation  analysis,  temporal  analysis,  and  for  stationary  target  trajectories,  frequency 
domain  analysis)  were  used  to  compare  the  results.  Additionally  subjective  comparisons  of  indi¬ 
vidual  runs  were  impressive.  The  values  for  the  OCM  parameter.s  used  hy  Kleinman  in  tills 
application  were  no*  reported.  He  did  comment,  however,  that  i  fi.-  only  humaii  limitatioi. 
paranieter  which  required  adjustment  from  one  tracking  system  to  another  was  the  motor  noise 
parameter  {Pm)-  This  need  is  attributed  to  the  great  variation  in  the  manipulator  characteristics 
of  the  TO\N  ,  DRAGON  and  TL\  systems. 

As  a  final  applicatioti  of  interest  in  tfie  current  discussion.  Grunwald  and  Merhav  (197(i). 
|197h)  used  the  OCM  to  model  manual  visual  field  control  of  a  (low-flying!  remotely  piloted 
vehicle  RPV  1.  The  model  was  originally  developed  to  model  the  manual  lateral  control  of  the 
Rf’V  ailing  a  nominallv  straight  refererice  path  A  five  degree  of  freedom  simulator  was  available 
tor  experimental  validation  of  the  models.  It  was  used  initially,  however,  to  investigate  the  effect 
f oriiit raints  on  the  visual  field  on  pilot  behavior.  Experiments  were  performed  in  which  the  pilot 
was  permitted  to  view  the  road  at  only  a  single  distance,  at  multiple  distances,  and  hnallv  in  an 
iincoristrairied  fasfiion.  The  data  indicated  that  his  betiavior  in  the  two  looking-distance  scenario 
closely  .matched  hi'  behavior  in  the  unconstrained  viewing  scenario  Kor  this  reason,  one-  and 
two-  looking-distance  models  were  develotied  These  were  int  orjiotai  ed  irito  the  ()('*v1  framework 
for  in  V  esi  igat  ion.  The  \FI  model  intluded  such  stai»  vdriafiles  as  laieia!  deviatioii  from  tiie 
refereiice  path,  yaw-  and  slip  angles  between  Mie  de'ired  path  and  veincle  axis,  and  velocity.  In 
the  first  stage  of  tiie  investigation  a  parametric  study  was  performed  to  determine  the  sensitivity 
of  various  model  responses  to  variations  in  moefei  parameters  Sui  fi  sensitivities  are  recjuired  it 
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the  pararneiers  are  to  be  accurately  estimated  via  experimental  procedures.  In  the  second  stage, 
the  resuming  one-  and  two-distance  models  were  compared  to  experimental  data.  The  investiga¬ 
tors  coficiuded  that  the  two-distance  \TI  model  is  a  valid  representation  of  the  visual  field  con- 
troi  task  with  unconstrained  viewing 

In  me  second  reference,  the  same  investigators  determined  the  extent  to  which  the  human 
ofjeratur  used  the  higher  order  information  in  the  stale  (e.g  lateral  velocity  and  acceleration). 
This  information  was  used  to  evaluate  the  effectiveness  of  augmenting  the  display  with  explicit 
indicators  of  such  quantities.  These  display  aids  were  evaluated  for  various  dynamical  condi¬ 
tions.  Again,  the  model  results  were  matched  with  expverimental  data  to  validate  the  model.  The 
authors  concluded  that  "the  analytical  model  proves  to  be  a  convincing  representation  of  actual 
man-machine  visual  field  control."  and  that  "this  model  proves  to  be  an  effective  research  tool  for 
the  prediction  of  system  performance  in  the  development  and  evaluation  of  display  aids."  The 
human  operator  parameter  values  determined  in  these  studies  are  summarized  in  the  reference 
and  a  complete  discussion  of  the  types  of  display  aids  considered  is  presented  there. 

The  references  discussed  above  are  just  a  few  of  the  many  which  indicate  that  the  OC.\l  can 
be  a  valuable  tool  for  the  analytical  investigation  of  the  performance  of  systems  which  utilize  a 
human  controller.  The  teleoperaled  vehicle  is  such  a  system,  but  its  nature  is  such  that  the  exist¬ 
ing  forms  of  the  OCM  do  not  result  in  an  adequate  representation  for  th  human  teleoperatcr. 
This  IS  due  to  the  unpredictable  qualities  of  the  environment,  in  concert  with  the  availability  of  a 
(visual  fiel  .  display  which  enables  the  man’s  detection  of  them.  Although  visual  field  displays 
were  used  in  some  of  the  applications  discussed  or  mentioned  above,  in  these  cases  the  ability  of 
the  human  to  extract  information  from  the  scene  which  was  not  directly  related  to  the  base  states 
was  not  relevant.  In  the  case  of  the  TO\  ,  however,  this  aspect  of  the  human’s  abilities  is  critical. 
A  mode!  based  on  the  OCV1  methodology,  but  which  accounts  for  these  higher  human  capabilities 
is  presented  in  Section  3 
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Nonlinear,  Discrete,  Sampled  Data,  Finite  State  Models 

■\  uif'  variel>  of  models  have  been  developer)  to  address  particular  types  of  human  control 
tasks,  or  sperihc  aspects  of  human  behavior  which  are  not  well-handled  by  the  standard  models 
discussed  above.  Beke\  1962  for  example,  proposed  a  sampled  data  model  for  human  response. 
The  model  essentially  used  the  DFM  to  model  the  human's  control,  but  assumed  additionally 
that  the  human's  behavior  was  characterized  by  sampling,  data  reconstruction  and  extrapolation 
operations  at  his  input.  Thus  model  was  motivated  by  evidence  in  some  research  that  human 
sampling  is  intermittent  The  introduction  of  a  sampler  explains  the  observed  presence  of  fre¬ 
quencies  at  the  human  s  output  which  are  not  in  the  input  signal.  Additionally,  the  inclusion  of  a 
first  order  hold  models  the  human's  ability  to  extrapiolate  in  the  absence  of  stimulation  (note  that 
this  ability  is  also  modeled  by  the  OCM).  This  model  structure  is  supported  by  experimental 
results  w  hen  the  input  frequency  is  high  (  >  1  cps).  but  in  general  for  lower  frequencies  a  continu¬ 
ous  model  is  adequate. 

In  some  control  systems  it  has  been  observed  that  humans  display  bang-bang  behavior. 
This  mav  be  due  to  the  nature  of  the  controller,  or  sorrie  aspect  of  the  controlled  element.  For 
example,  \oung  and  Meiry  1965  note  that  "when  the  human  operator  is  placed  in  a  control  task 
with  a  difficult  high  order  controlled  element  requiring  considerable  lead  compensation  on  his 
part  for  stable  closed-  loop  operation,  his  tracking  becomes  quite  non-linear  even  with  a  continu¬ 
ous  control  stick."  Systems  of  this  type  motivated  their  development  of  a  simple  on  off  model  for 
tiuman  behavior.  This  non-linear  model  has  proven  to  be  useful  for  certain  controlled  elements  in 
systems  in  which  minor  perturbations  are  not  important,  and  it  is  rather  the  operator's  task  to 
establish  a  limit  cycle  to  keep  the  system  within  allowable  bounds 

fini'e  state  models  have  also  tieen  proposed  to  nnKlei  hurrian  control  m  som«  tasks.  ff«‘kev 
and  .\.’igel  1968  proposed  a  model  in  which  the  op<*rat<»r  is  modeled  as  a  finite  state  machine 
•  hjcf,  switches  among  the  states  based  on  the  states  of  the  svstem  Hurnharti  and  liekev  !'<76 
used  this  general  approach  to  model  the  fuiniari  driver  in  a  single-lane,  la  following  task  The 
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states  of  the  svstem  were  quantitized  to  yield  a  four-slate  system  model.  A  decision  logic  was 
a'-''Ofiated  with  each.  Examples  of  stales  lhe>  used  are: 

•  tin  car  is  at  a  desired  velocity  but  is  too  close  icj  the  lead  car 

•  the  car  is  at  the  desired  relative  position  but  is  moving  up  on  the  lead  car. 

Details  of  the  decision  logic  are  presented  in  the  reference  This  approach  ma>  be  useful  when 
meaningful  q  janilzation  of  all  or  part  of  the  system  is  possible. 

These  are  just  a  few  of  nonlinear,  discrete  approaches  to  human  controller  modeling.  There 
are  man\  moi  .  each  developed  for  a  very  specific  reason.  Johannsen  1976,  for  example,  pro¬ 
posed  "x  model  which  incorporated  threshold  elements  and  decision  elements  to  account  for 
human  physiological  characteristics.  In  summary,  these  models  have  led  to  promising  results  for 
the  specific  tasks  or  situations  for  which  they  were  developed.  They  are  not.  however,  generally 
suited  to  or  easily  adapted  to  new  systems.  They  tend  to  be  dynamically  dependent:  a  change  in 
system  dynamics  can  result  in  a  change  in  model  structure.  The  non-linear  nature  of  the  models 
additionally  introduces  other  drawbacks  for  many  tasks.  Parameter  identification  and  closed 
loop  performance  analyses  of  the  type  to  be  performed  in  the  TOV'-ar.aly sis  program,  are  much 
more  difficult  to  accomplish  when  non-linearities  are  in  the  loop.  This  makes  the  models  more 
difficult  to  develop  and  to  use.  If  feasible,  a  standard  approach  for  which  a  well  developed  theory 
exists  is  thus  preferable  for  the  current  TOV  modeling  task. 
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3.  THE  GENERALIZED  OPERATOR  MODEL 

Tfie  dynamic  model  of  an  encounter  involving  a  TO\'  contains  se-.  -ral  interconnected  suL>- 
niodeis.  Some  of  these  have  been  discussed  in  greater  or  lesser  detail  ir‘  the  preceding  sections  of 
this  report  The  vehicle,  etc.  are  conveniently  described  by  stochastic  differential  equations.  The 
completed  encounter  model  requires  a  compatible  model  for  the  remote  operator. 

The  previous  section  reviews  several  alternative  vcays  in  whicfi  a  human  operator  has  been 
modelled  u  hen  engaged  in  tracking  and  control  tasks,  because  it  permits  the  inclusion  of  both 
time  variability  and  randomness,  the  formalism  leading  to  the  optimal  control  model  (OC.Vl)  is 
attractive.  The  classical  OG.Vl  is  a  linear  stochastic  differential  equation,  and  is  therefore,  easily 
included  v^ith  the  other  sub-models  to  form  the  full  encounter  state.  The  weighting  parameters 
in  the  performance  index  can  be  selected  to  cause  the  OC.Vl  to  mimic  the  behavior  of  an  actual 
operator. 

VN  hile  the  OCM  has  been  used  successfully  in  diverse  applications,  it  is  a  "short  time"  model 
of  human  response.  It  has  been  found  to  be  most  suitable  when  a  trained  operator  is  performing 
a  well-defined  task  in  a  familiar  environment.  His  primary  function  is  noise  reduction.  There  is 
little  opportunity  to  use  his  decision  making  capability  in  the  context  of  hi.s  assigned  task 

The  remote  operator  of  a  TOV'  must  respond  to  more  varied  stimuli  than  does  his  counter¬ 
part  assigned  a  conventional  pointing-and-tracking  task.  He  must  utilize  the  capabilities  of  the 
TOV  in  an  unpredictable  environment.  This  charge  requires  more  of  the  operator’s  ability  to 
identify  the  relevant  characteristics  of  a  time  varying  and  ambiguous  scene.  Thus,  he  is  required 
not  only  to  follow  a  target  as  it  meanders  within  his  field  of  view  ,  but  he  must  additionally  iden¬ 
tify  sudden  changes  in  target  motion,  or  any  other  events  which  infliieure  the  dynamic  structure 
of  the  encounter. 

In  this  study  the  operator  still  arts  reffexively  in  the  sense  described  earlier.  The  remote 
operator  is  assumed  to  have  a  good  understanding  cjf  the  current  scenario,  and  to  have  plan  of  the 
appropriate  actions  which  he  should  take.  Hi.s  uncertainty  about  the  current  state  of  the 
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( rn  ounier  has  two  distinct  components.  On  the  one  hand,  he  uses  his  observations  to  estimate 
itjt  stale  of  the  primary  constituents  of  the  encounter;  e.g.  the  center-line  of  the  path  to  be  fol- 
lov\ed.  the  position  and  velocity  of  the  target,  etc.  An  analogous  functional  block  is  to  be  found 
in  the  conventional  OCM. 

The  second  component  of  the  TOV’  operator  model  results  from  the  fact  that  certain 
dynamic  properties  of  the  primitives  of  the  encounter  may  change  abruptly.  Because  the  opera¬ 
tor  is  assumed  to  be  cognizant  of  the  possible  changes  which  may  take  place,  it  will  be  supposed 
that  the  mode  of  evolution  of  the  encounter  is  indicated  by  a  random  process  {r,}  with  state 
space  {l....,.V}.  The  process  {r^}  will  be  thought  of  as  delineating  the  current  status  of  the 
encounter,  and  {r, }  will  be  referred  to  as  the  feature  process. 

The  inclusion  of  a  feature  process  is  not  common  in  the  literature  on  operator  models.  If 
there  is  but  one  environment,  the  notion  of  a  feature  indicator  is  superfluous.  Alternatively,  if 
the  features  change  infrequently  and  are  sufficiently  unambiguous,  the  operator  can  be  thought  of 
as  adaptively  changing  his  own  behavior  in  concert  with  the  exogenous  process. 

This  section  considers  an  intermediate  situation  in  which  the  feature  changes  are  sufficiently 
frequent  and  equivocal  that  the  operator  must  accomplish  his  desiderata  in  the  presence  of  both 
uncertainty  in  {r, }  and  significant  modal  transients.  The  resulting  op>erator  model  is  still 
reflexive,  but  the  time  scale  of  the  human  intervention  is  extended  beyond  that  of  the  OCM.  To 
distinguish  these  models,  the  description  of  the  operator  of  the  TOV  will  be  termed  the  general¬ 
ized  operator  model  (COM). 

To  be  more  specific,  denote  by  z,  the  conventional  dynamic  state  of  the  encounter  including 
components  related  to  the  targets,  the  path  and  the  TOV.  This  portion  of  the  system  description 
will  be  called  the  base  state.  Let  4>^  be  an  N-vector  which  indicates  the  current  value  of  the 
feature  process; 

/  1  if  r,  =  . 

|0  otherwise  (^  0 

Then  the  encounter  dynamics  will  be  given  by  the  joint  dynamics  of  {i, }  and  {d>,}. 
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Define  an  augmented  encounter  state  {f, }  by 


C/ 


Oi 


(3.2) 


It  wi!)  be  assumed  that  the  dynamics  of  the  base  state  are  given  by  an  equation  of  the  form 


dx,  -  (/l2(  -  Bu,)  dt  p'dcp,  dw,  (3.3) 

where  p '  is  a  fixed  ri  .V-matrix.  The  other  variables  in  (3.3)  have  the  same  interpretation  they 
had  in  (■/  37). 

Before  continuing  it  is  well  to  review  the  implications  of  (3.3)  and  to  contrast  it  with  the 
equation  which  gives  rise  to  the  OCM  (see  (2.37)),  If  there  were  no  feature  dependence,  then 
p  =  0.  In  this  event  (3.3)  becomes  identical  to  (2.37).  .Alternatively,  if  the  features  are  unchang¬ 
ing.  dOf  =  0,  then  p'0^  =  p^  and  (3.3)  is  equivalent  to  (2.37)  with  an  additive  bias.  Such 
dynamic  structures  are  easily  accommodated  by  the  OCM. 

Eciuation  (3.3)  differs  from  (2.37)  in  a  fundamental  way  when  {4>^]  is  variable.  Suppose  the 
target  suddenly  accelerates.  This  would  be  indicated  by  a  change  in  the  component  of  {<?,} 
which  corresponds  to  target  acceleration,  i.e., 

p'di,  .  (3.4) 

The  base  state  contains  a  component  (target  acceleration)  which  experiences  an  abrupt  change. 

liie  feature  process  {0^ }  will  be  assumed  to  be  a  Markov  process  with  transition  matrix 

Q  =  : 

I  1  -  -  o(A)  ;  X  ^  ] 

Prob(r,,^=j  r,  =  ,)  =1  ^  (3  5) 

The  elements  of  Q  have  a  simple,  intuitive  interpretation.  The  mean  lifetime  in  state  i  is  9,,  '. 
The  probability  that  (r,  }  will  make  an  1  -*  j  transition  is  -  9,,  9,,.  Consequently,  the  Markov 
process  hypothesis  leads  to  a  model  whose  parameters  can  be  estimated  from  easily  discernible 
sample  function  characteristics  of  (  r, } 
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To  combint  the  dynamics  of  {2,}  and  }  into  a  suitable  model,  let  {Q,F,P)  be  the  pro- 
babilit>  space  on  which  the  exogenous  process  of  the  encounter  model  are  defined.  Let  {Fj}  be 
the  filtration  on  O.at  generated  by  { u, ,  r, }.  Then  {4>t)  can  be  described  by  the  stochastic 


differential  equation 


=  Q  0idt  t  dm, 


where  { m, }  is  a  purely  discontinuous  { }-martingale.  Equations  (3.3)  and  (3.6)  can  be  com¬ 
bined  to  form  the  external  portion  encounter  dynamic  model; 


A  pQ‘ 
0  Q- 


u,  dt 


0) 


or  more  compactly 


=  (^ff(  -  Gu,)dt  I-  F„dw,  -t-  F„dm, 


where  the  composite  factors  in  (3.8)  are  identifiable  in  (3.7).  It  is  frequently  convenient  to  write 

F  F 

matrix  relations  in  block  form  without  the  additional  comment;  e.g.,  if  F^  =  p”  ,  F^ 

‘  ^21 

would  be  interpreted  as  A  ,  {F12)  {p  Q 

Equation  (3.8)  gives  the  intrinsic  model  of  the  external  elements  of  the  encounter.  Note 
again  that  (3.8)  differs  from  (2.37)  because  of  the  discontinuous  term  {F„dm,}.  The  human 
operator  ties  the  disparate  parts  of  the  total  system  together  through  his  reaction  to  observations 
of  the  evolving  engagement.  The  observation  s^rucrure  will  be  assumed  to  have  a  decomposition 
that  conforms  to  the  state  decomposition  indicated  in  (3.2).  Denote  the  observation  vector  by 
{y, }.  Then,  it  will  be  assumed  that 


‘'J'*  =  (  0  b'  ) 


H^fdt  +  dtji 


where  b  is  an  N-vector,  (r;, }  is  a  vector  Brownian  motion,  independent  of  {tv,,  m, }.  and  with 
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intensity  /?  >  0: 

-  Rl  (3.10) 

Let  the  filtration  generated  by  {y,  }  be  denoted  by  {  }. 

The  observation  model  given  by  (3.9).  has  the  generic  form  that  was  used  in  the  conven¬ 
tional  OCM  (see  (2.40)).  Again  the  salient  difference  resides  in  the  operator’s  reaction  to  the 
feature  vector.  Equation  (3.9)  indicates  that  the  operator  observes  a  noisy  version  of  the  current 
feature  state.  Feature  t(r,  =  t)  is  represented  by  a  signal  6,  contaminated  by  an  additive,  wide¬ 
band  noise.  The  speed  with  which  the  operator  can  detect  changes  in  features  is  related  to  the 
size  of  6,  (strength  of  the  stimuli)  and  the  size  of  the  associated  element  of  R  (noise  intensity). 
These  are  parameters  which  will  be  used  to  tailor  the  model  to  the  empirical  response  characteris¬ 
tics  of  e  operator.  The  OCM  uses  no  feature  information,  and  it  is  thus  represented  by  (3.9) 
without  the  {4>i)  component. 

It  is  well  to  note  that  the  observation  model  given  by  (3.9)  is  an  intermediary  for  describing 
the  input-output  behavior  of  the  operator.  It  is  not  intended  to  describe  the  physiological 
processes  that  occur  within  the  operator.  The  ostensible  "observation"  in  (3.9)  characterizes  an 
illusory  knowledge  state  in  the  operator  which  is  generated  after  he  performs  a  stage  of  physiolog¬ 
ical  scene  processing.  Thus,  while  { y, }  is  an  intermediary  m  formulating  the  GOM.  it  is  not  an 
observation  in  a  literal  sense.  The  validity  of  (3.9)  is  based,  therefore,  on  indirect  measurements 
of  operator  resfjonse,  e.g.  delays,  false  alarm  rates  etc.,  rather  than  on  direct  measurement  of 
scene  stimuli. 

ith  the  formalism  described  in  the  previous  section,  the  explicit  description  of  the  GOM 
can  be  derived.  The  operator  is  assumed  to  act  to  minimize  a  quadratic  performance  index  7, 
given  by 

T 

'  f  (fr'^fr  +  v,'.\u,)dT  Y,  ■ 


y,  =  E 


;  A/  ^  0,  V  ..  0 


(3.11) 


In  the  previous  section,  it  was  shown  that  the  conventional  OCM  consists  of  a  cascade  combina¬ 
tion  of  two  blocks,  a  time-dependent  processor  creating  an  estimate  of  the  base  state,  and  a  gain 
block  which  creates  the  actuating  signal  therefrom.  This  latter  is  independent  of  the  peculiarities 
of  the  exogenous  disturbances.  Indeed,  as  will  be  shown,  this  block  in  the  GOM  is  identical  to 
the  corresponding  block  of  the  OCM. 

The  structure  of  the  time-dependent  processing  is  more  complicated  in  this  application.  The 
dynamic  behavior  of  the  GOM  is  governed  by  the  equations  which  dtiineate  the  first  two  condi¬ 
tional  moments  of  {f, }; 

U=E{u\Y,}  (3.12) 

P^  =  Var  {f,  I  T,} 

The  requisite  dynamics  of  the  GOM  are  the  stochastic  eqfations  which  characterize  the  evolution 
of  {;, }  and  {Pt). 

The  equations  of  the  time-dependent  processor  are  derived  in  this  section  as  a  sequence  of 
propositions.  Because  the  development  is  rather  convoluted,  the  analytical  details  are  provided 
in  the  appendix.  The  most  important  of  these  results  is  given  in  Proposition  2.  This  proposition 
provides  a  representation  of  the  GOM. 
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PROPOSITION  1.  The  exogenous  processes  {m, }  and  {u',  }  in  (3.7)  have  the  local  moments; 


E  ( dw,  F, 

}  -  0 

(3.13a) 

E  { dm,  F, 

}  =0 

(3.13b) 

(du-,)(dt),)- 

II 

(3.13c) 

E  ( dm,  dm, 

.V 

F.)  -  S 

1  1 

(3.13d) 

-  V(c>,  )dt 

Q,  <f.9.  9.  <f>,  ~  cliag  {q,  ) 

vkith  O,  the  indicator  of  mode  i.  and  7,  the  i  th  row  of  Q .  ■ 

.\s  indicated  above,  { m,  )  and  {tc,  }  are  both  {F, }  martingales.  Interestingly,  the  quadratic 

variance  of  { rn,  }  is  a  linear  function  of  {<?, }  while  the  quadratic  variance  of  {v,  }  is  constant. 

The  next  proposition  gives  a  representation  of  the  estimation  portion  of  the  operator  model 
It  is  phrased  in  terms  of  the  innovations  process  {1/,}.  Let 

du,  =  dy,  -dy,  (3.141 

Fr«m  (3  9),  this  can  be  written  as 

di^f  =  Hq,dt  -  dr},  -  H^,dt 

=  H (;,dt  dr),  (3.1.")) 

where  c,  is  the  estimation  error  prixess; 


S( 


(3  I61 


The  fundamental  result  of  this  section  is  given  next 
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i'R<  >1’C)SIT10N  2.  The  conditional  mean  process  satisfies  the  equation 

Gti,)  dt  P,H  R  ^diy,  (3.17) 

■ 

Equation  (3.17)  gives  the  explicit  form  of  the  information  processing  block  in  the  operator 
niodtl.  It  has  the  functional  form  of  a  Kalman  filler,  although  as  will  become  apparent,  there  are 
fundamental  differences  in  these  two  estimators.  These  dissimilarities  are  made  clearer  in  the 
next  proposition. 

PROPOSITION  3.  The  error  covariance  satisfies  the  stochastic  differential  equation 

dP,  =  {F.p,  -  P,F;  -  F,  nP^  +  F„  S  F„  -  P,H  R-'HP,)dt  *  dp,  (3.18) 

•  =  I 

P,  =  E 

where  { p, }  is  given  by  the  equation 

dpi  ^  ,  •••  ,  R-'di',)  (3.191 

P.  =0 

The  relationship  betw'een  the  OOM  and  the  conventional  OCM  becomes  clearer  by  reference 
to  Proposition  3.  As  mentioned  earlier,  the  time-dep>endent  processing  done  by  the  oj>erator  is 
delineated  by  (3.17).  This  equation  is  identical  for  both  operator  models,  although  the  implica¬ 
tions  are  profoundly  different.  The  OCM  uses  the  Kalman  gain  which  utilizes  a  covariance 
matrix  { P, )  given  by  the  solut  n  to 

P,  -  F.P,  *  P,F,  -  C.  HC.  P,H  R  ^HP,  (3.20) 

Equation  (3.20)  differs  from  (3.1*)  in  two  fundamental  ways,  the  first  rather  obvious  and 
the  second  more  subtle  The  discrei*-  martingale  { m,  )  has  no  relevance  to  the  OC.M.  and  hence 
makes  no  contribution  to  {P,}  in  (3.20).  If  (3  8)  is  compared  to  a  linear.  (iauss-Nlarkov. 


>^1  Dchaslic  differential  equation  as  given  in  (2.37),  the  most  obvious  difference  is  tha.  the 
Brownian  motion  term  in  (3.8)  is  augmented  by  an  orthogonal  term.  i.e. 

F^dw,  -  F^,dui  -  F^dm, 

Hence,  it  might  be  conjectured  that  the  quadratic  variance  term  in  (3.20)  should  be  replaced  bv  a 
sum  of  terms  in  (3.18); 

F{(F^dwi)(F^.dw^)'  I  Y,}  E{{F^dw){F^du  )'  ~  {F„dm,){F„dm,]'  },} 

or 

F^WF^'  ^  F^WF^  -  ^  F^  V{0,)F^ 

This  heuristic  reasoning  is  validated  by  (3.18).  The  trend  term  m  {F^)  is  given  by  an  equa¬ 
tion  that  is  identical  to  that  which  obtains  in  the  Gaussian  case.  The  only  property  of  the  feature 
process  that  influences  the  trend  in  {Pt}  is  its  quadratic  variance. 

The  forcing  term  in  (3.18)  has  no  analog  in  (3.20).  The  conditional  covariance  in  the  Gaus¬ 
sian  problem  is  independent  of  the  observation  process.  In  this  application.  Iiowever.  trie  condi¬ 
tional  covariance  {P,}  is  a  random  process,  and  hence  the  gain  in  the  estimator  given  in  (3.17)  is 
random  as  well  Furthermore,  (3,17)  and  as  a  consequence  the  GOM  are  nonlinear  functions  of 
the  \y,)  prcxess  This  again  contreists  with  both  the  CX-M  and  the  transfer  function  models 
described  earlier. 

Fiquation  (3.18)  can  be  viewed  as  a  Riccati  equation  driven  by  the  observation  prcxes.s  To 
gain  ir. sight  into  its  behavior,  suppose  that  were  conditionally  Ciaussian  If  it  has  been  a  long 
time  since  a  feature  change,  this  would  be  a  good  approximation  of  the  actual  distribution. 
Because  is  svmmetric  about  its  mean, 

CC  f,  ^0  for  all  T  (3.21  I 

Hence,  (3.18)  is  equivalent  to  (3.20)  with  the  substitution  indicated  in  (3.21) 
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Suppose,  alternatively,  that  f,  is  not  symmetric  about  its  mean.  This  will  certainly  be  the 
rase  \^hen  a  feaiur®  transition  is  suspected.  The  forcing  term  {/>,  }  will  now  cause  the  conditional 
ct>\ariance  to  change.  Interestingly,  but  not  surprisingly,  this  change  tends  to  be  toward  higher 
co\a"iances  and  ilie.nre  toward  faster  dynamics  in  the  CO.M.  To  see  this,  suppose  that  all  of  the 
svstem  variables  w ere  scalar.  Then 

CC  V, ^  ’di-,  =  p  *//*/?  -  noise 

The  asvmmetrv  in  c  is  such  that  tends  to  have  the  same  sign  as  f  with  the  result  that  { P, } 
tends  to  increase. 

hi  the  COM  proposed  here,  ife  operator  dynamic  gain,  PH  is  a  random  process 

because  { Pj  }  is  driven  by  the  innovations  prc>cess  }.  The  responsiveness  of  the  operator 
changes  as  his  evaluation  of  the  situation  changes.  Hence,  the  operator  model  is  nonlinear  as  well 
as  being  time  varying.  It  is  the  situation-dependent  aspect  of  the  model  which  makes  this  gen¬ 
eralization  so  attractive.  By  contrast  to  other  studies,  the  feature  dependence  of  the  COM  is  not 
ad  hoc,  but  is  instead  a  consequence  of  the  feature  dynamics. 

The  information  processing  portion  of  the.GOM  given  by  (3.17)-(3.18)  is  not  complete.  The 
factors  which  multiply  du,  in  (3  19)  are  conditional  third  moments  of  To  finish  the  descrip¬ 
tion  of  the  estimation  block,  the  structure  of  this  conditional  moment  must  be  displayed. 

\s  pointed  out  earlier,  if  i;  were  Gaussian  (the  conventional  OCM)  this  would  be  a  trivial  - 
f  C  c.  =  0  if  f  is  Gaussian.  Unfortunately,  in  this  application  the  evolution  of  {p,)  is  more  com¬ 
plex. 

PROPOSITION  4  Denote  the  conditional  third  moment  of  q  by  fl,;  specifically 

=  (3.22) 

Then  the  equation  for  (  P,  }  can  be  written  as 
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dP:  -  {F^P,  *  P,F:  -  Wf„  .  F^  P,H'R  ^HP,)dt  (3.23) 

.  (n(i)/y  /?  'di^, .  .  n(.v  ^  n)H  r  'du,) 


F  urt  her 


=  fiik)  ,dt  *  (H(/.ik)  -  \uH  -)R-^diy,  ;  /  =  1,  2,...,  A'  (3.24) 


where  f  z{k]  is  given  by  (A. 4. 23),  z.{l  ,k)  is  given  by  (A. 4. 29)  and 


A*/  =  (C<r'  f*)  if’ 


(3.2.S) 


Equation  (3.17),  (3.18)  and  (3.24)  give  the  dynamic  equations  of  the  (jC)\l  This  model  is 
expressed  as  the  solution  to  a  set  of  three,  coupled,  stochastic  differential  equations.  I  nfor- 
tunately,  the  system  of  equations  is  not  closed  in  the  sense  that  a  mechanism  for  evaluating 
has  not  been  presented.  Issues  arising  from  the  dependence  of  the  GOM  on  will  be  addressed  in 
the  next  section. 

The  action  blcxk  of  the  GO.M  is  a  relation  between  and  u. 


PHOPOSITION  5 


w  here 


Rgou  ~ 


A  'G 


-'i 


and  ( E,  )  is  given  by 


r.L,  -  'F^F  ^  L,GV  U.  E, 


\1 


(3.26) 


(3.27) 


(3.3(1) 


V 

--  T 


0 
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F’roposition  5  completes  the  representation  of  the  GOM.  As  was  tr,e  case  in  the  OCM,  the 
model  takes  the  form  of  a  linear  funci  on  of  the  conditional  mean  of  the  encounter  stale.  This 
"gair,"  IS  ii. dependent  of  both  the  form  and  intensity  of  the  exogenous  processes  The  dynamics 
of  tfje  GOM  are  those  of  the  estimation  process. 

Before  leaving  this  topic,  it  is  well  to  note  explicitly  the  way  in  which  the  GO.M  is  related  to 
t^  e  0(  M  It!  most  cases,  the  slate  penally  matrix  M  provides  a  weighting  on  only  the  base-state 

components. 


M  = 


V/n  0 
0  0 


F  rorn 


■  s  T  I 


(3.31) 


-n-'t 


'12 


(3.32) 


where 


(3.33) 


Under  condition  (3.31)  though 


V 

‘'12 


‘\iP 


^.2)<? 


(.4  • 


V 

‘-'11 


BS 


(3.34) 


If  Q  is  sufficiently  small,  then  will  be  small  with  the  result  ,iiat 

u,  •'  V  '/?  (3  3.M 

To  hrsl  approximation  then,  the  GOM  is  identical  with  the  0('M  The  term  U,,©,  in  13  321  is  a 
cor-ertior,  to  the  (.)CM  which  accounts  for  the  dynamics  of  the  feature  process  ' 


1 


Note  that  L|i  IS  also  dependent  on  the  properties  of  the  feat  it e  process 


4.0  SIMPLIFICATION  OF  THE  OPERATOR  MODEL 

1  'if  (lOM  is  »iveii  h'.  (.'i.lTj,  i  .iTuI  ' '1  )  fie  hrsi  three  of  these  equations  are 

suiv  'idsiic.  iTiatrix  equal  1(111'  i.'  Ia;rl\  fuf  d; iru  lisioi.  /'.  tor  example,  is  in  ari  i  \  •  ii  |  ■ 

'  \  -  r.  ;  matrix  w.^iere  n  tfa  d  i  mensioii  of  t  tie  t)as<  'tate  Higtier  moments  becottie  e\en  more 

iM.mer.iu'  To  make  tlii^-  nioiiei  uselui  m  predi  tint;  I  0\  (lerformani  e,  certain  sirnplifK  ,il  im:' 
rtiust  t>e  iT.aiie.  These  redactions  in  corni>iexit\  take  tlie  form  (jf  judicious  appr ox iii lat ions  to  r  j,* 
exact  expi'esMuns  ixfinf.  dclineatt  theCiOM. 

Trie  encounter  siate  (or.iairis  t^o  clearT  disc  errn'iie  snti-vectors;  i  tie  tiase  state  j  ano  '  t.i 
b'a'  i'e  st  at  e  O  'I  tie  ot iser  \  ai  ion  v  ei  l  m  n as  I  he  same  dei  ooi  ;,os  1 1  loi,  \  i  i  tio'  'p, ti  i  fit  !  .  h  ol  ist  ■  ,  , 
’•'I'  1'  uselui.  It:  print, pie.  li.  exaiua'itip  'tie  operator  tends  ii  deierimne  'tie  'eat  ire-s  o'  :  •,( 

(•'.(oLrder  fr<jm  an  obserxalion  of  'f'c  fe-a'ures  i  he  rtisel  v  e-s  In  tie  •-oei  dn  intio'ii  tte  tea'  ,'i- 
ori'ser  .  a’ ,  >ili  vector  tiv  y.. 

(/  y  j  'r  ■;  <1  r  •  ' '  I  ti  r 11 
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!  vc(jij*'nr!ng  of  the  operator’s  signal  processing  has  implications  with  resy>ecl  to  the  form 
•  (  j  jcitH'ti  for  {;,]  I  sing  1-1.2),  the  equation  for  \o,  }  can  be  written  directly.  This  equa- 
aSti  given  in  such  references  as  L.lliott  (19!<2)  .  To  illustrate  tne  utility  of  (3.171,  the 
•  .  '  if  ;s  derived  from  the  genera,  ecjuatioi.  for  s 

r '  t{  (  ) !  *1  )  -  1  I  1  ( I  \  (.  7  fje  d  y  namn  equat  loi.  for  ■'  o,  ]  is  gi  v  en  In 

do.  Q  0.dt  ~  iB  b,  I )  c.  H  ^  ^  (->.1  d  i‘.  (4.5) 

A  • .  e  ”  ^ 

/f  diag  1  6 1 .  b^  )  (4.6) 

6,  -  6  (p,  ^ 

Fixation  14  'll  provides  an  impcjrtant  portion  of  the  oyverator  dynamics.  It  is  a  closed  s>  s- 
'e.m  i/‘  eu.,ati(tns  in  the  sense  used  earlier,  but  noniinear  because  of  tfie  product  dependence  of  the 
seror.tt  'e'ni  on  tFie  rigtit  hand  side  of  :4..5! 

Tne  e(p,aiion  for  in.  )  can  be  w  rater,  as 

dr.  /  4  n  •  p  V  <p.  -  /fc.  d!  ■  I  .(J  r.B  H  'dt, 

I'f  en^a',  ion  for  <B  ,•  s' ill  remains  to  t>e  solved,  aithougf.  tfie  lower  right  tiand  blocs  of  ( /’,  i  is 
K  ’.ow  r  see  \  6.3  I  ' 

diagi  C'l 


5.  CONCLUSIONS  AND  DIRECTION  OF  FUTURE  EFFORT 


report  presents  the  dev  elopriierit  of  a  systent  tnodei  wfiich  will  be  used  to  stu(i\  the 
i.pera;  iori  of  a  l  OV  in  a  noili  itask  environtiient  Models  of  those  parts  of  the  encounter  tfiat  are 
ex'eriiai  to  t  fu’  remote  operator  targets.  'I'OX  .  etc  .  are  combined  to  fcjrrn  the  base  state.  A 
moUa;  indicator  vector  or  feature  state  augments  t  fie  base  state  to  fcjrm  tfie  encounter  stale. 

In  order  to  (piantifv  the  interaction  between  liie  T(i)\  and  its  surroundings,  a  suitable  model 
for  tfie  operator  is  required.  Tfie  methodology  employed  in  the  generation  of  the  conventional 
<  X '\1  IS  used  fiere  to  create  a  dynamic  structure  (tfie  CO.Ml  whose  response  mimics  tfiat  of  a 
'■enu.j'f  operator  •  r.gagei.l  ir.  guidance  and  tra'ko.g^  tai.ks  iri  an  abruptly  changing  environment. 

Ttie  (;0.\1  fias  attractive  properties  in  ifiis  application.  The  full  state  model,  containing 
botfi  the  operator  and  the  encounter  states,  is  phra.sed  in  terms  of  a  set  of  stochastic  differential 
equaiions.  Such  equations  lend  tfiernselves  'o  digital  simulation,  thus  permittuig  an  investigation 
of  specific  scenarios.  Further,  such  useful  statistical  measures  of  performance  as  the  error  covari¬ 
ance  ;  /’.  }  are  computed  as  part  of  the  analysis  procedure. 

In  the  fort fu  ornirig  period,  effort  will  be  devoted  to  completing  the  fjOM  and  to  iiivestigal- 
■ng  multitask  ajiplK  at  ions,  t.'omjtlet  ion  of  '.tie  U(>.V1  rc»quires  the  dynamic  equation  for  (  f,  [ 
The  eejuation  given  in  (3.17)  is  a  representational  result  rather  than  a  computational  recipe 
ofca.ise  t  )  is  not  provided  in  a  computationally  feasible  form.  Even  under  the  hypotheses  del- 
.nea'ed  m  section  4.  only  tfie  lower  right  fiand  tilock  is  known  explicitly. 

lo  'provide  the  requisite  equations,  some  simplification  of  the  third  moment  equation  (3.24) 
necessarv  It  is  proposed  thai  'fii‘  f‘*  done  bv  assuming  that  the  condit  ional  distribution  of 

lott,  t.dussiaii  and  niititiaussian  ('Hcis  .-X.,  ijitiuioried  earlier,  the  tiaussian  part  p'av'S  nc  cole 

I!  ].  (•  11'  o  if  were  (.aussian.  .An  analytically  tractable  approxiinaiion  tcr  the 

.n<  .au^'^ian  fiart  can  tfien  be  used  lo  create  a  term  wfiich  can  tie  used  lo  replace  (  \  }  in  (3.24). 

I  ■  s  oariitioning  of  iiPo  separate  parts  will  result  m  a  finite  dimensional  operator  model. 
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Performance  analysis  of  the  TOV  system  is  based  upon  the  GOM.  However,  the  imposition 
of  several  simultaneous  task  requirements  creates  a  distinctly  different  sort  of  dynamic  structure 
from  that  encountered  when  there  is  but  one  activity.  To  place  this  situation  within  the  model 
structure  defined  here,  it  will  be  assumed  that  the  operator  monitors  his  diverse  tasks  by  sequenc¬ 
ing  his  foveal  direction.  Specifically,  suppose  that  the  operator  must  pierform  two  antithetical 
tasks  simultaneously,  e.g  follow  an  irregular  path  while  simultaneously  tracking  an  evasive  target 
at  e  large  angular  distance  from  the  path.  It  will  be  supposed  that  the  operator  can  focus  his 


attention  in  one  of  two  directions; 


Cl  dt  ^  d  TJ, 


w  here 


/  1  if  current  foveal  direction  is  along  the  path 
~  \  2  if  current  foveal  direction  is  toward  the  target. 


The  indexing  of  the  observations  gives  another  dimension  to  the  actuating  signal;  i.e.  the 
operator's  action  is  now  the  pair  {n,  p,}  rather  than  only  {u, }  as  in  the  GOM. 

Without  further  restriction,  (5.2)  would  lead  rather  quickly  to  degeneracies  in  the  choice  of 
{p, }.  Rapid  switching  in  direction  of  observation  would  give  rise  to  performance  that  is  indistin¬ 
guishable  from  that  which  obtains  from  dual  fovea)  observations;  a  physical  impossibility  if  only 
one  operator  is  permitted.  Thjs  anomaly  can  be  avoided  if  every  1  -•  2  or  2  ->  1  transition  in 
{p, )  must  pass  through  another  state  Pt  =  ^  such  that 

H,  =  0  (5,3) 

The  rate  at  which  an  intelligent  operator  will  make  changes  in  his  direction  of  observation  is 
regulated  by  the  lifetime  in  state  3. 


This  behavior  is  made  most  apparent  if  the  properties  of  {P, }  are  explored.  Partition  {P,} 


I  ^112  I  (5-4) 

P21  I  ^22 

w hcrt  P 22  is  the  conditional  error  variance  of  the  feature  state  (see  (4.8)),  P^n  is  the  error  vari¬ 
ance  of  those  components  of  the  base  state  most  relevant  to  tsisk  1,  and  P112  the  same  quantity 
for  the  portion  of  the  base  state  relevant  to  task  2. 

The  inherent  flexibility  of  the  GOM  provides  the  means  to  quantify  the  behavior  of  the 
operator  even  in  the  presence  of  variations  in  observation  direction.  To  see  this,  note  that  the 
dynamics  ot  {  }  depend  upon  the  matrix  {Pj}  with  "faster"  response  associated  with  "greater" 

uncertainty.  To  model  the  operator’s  reaction  in  the  two-task  application,  it  could  be  supposed 
that  the  line-of-sight  direction  is  changed  whenever  the  performance  of  the  unattended  task 
degrades  to  an  unacceptable  degree. 

As  an  example  of  how  this  idea  might  be  stated  more  precisely,  denote  by  {  L’, }  the  cost  in 
the  unattended  task  and  suppose  that 

I  Tr  A/ii,  Pjj,  if  P|_  =  2 
"  I  Tr  A#,,2Pi,2  if  P,_  =  1 

where  is  a  decomposition  of  M  compatible  with  the  decomposition  of  {P, }  shown  in  (5.4). 

.Note  that  {  L’, }  need  not  be  defined  for  {p^  =  3}  since  this  is  a  transition  state. 

The  discrete  portion  of  the  operation  algorithm  can  then  be  given  by  an  inequality  of  the 

form 

dpi  ^0  if  Vf  <  fj  (5.6) 

where  i  is  a  threshold  selected  to  distinguish  acceptable  from  unacceptable  performance.  To  the 
extent  that  this  line  of  research  proves  fruitful,  the  question  then  arises  as  to  how  small  can  U  be 
chosen  in  order  that  the  operator  model  not  become  indeterminate;  i.e.  subject  to  (5  6). 


lim  Tr  M,,.  P...  >  U  .  1  -  1.  2 


15.7) 
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Switching  procedures  like  that  indicated  in  (5.6)  have  been  used  by  other  investigators  in  the 
LQG  conie.xt.  In  such  references  the  switching  sequence  can  be  precomputed  since  {P,}  is  not 
random.  In  the  case  under  study,  the  evolution  of  { P, }  is  situation  dependent,  and  the  switching 
limes  cannot  be  computed  a  pnori.  Further  the  statistical  properties  of  the  error  variance  are 
not  yet  clear!)  comprehended.  Hence,  the  best  form  of  the  switching  criterion  cannot  be  defined 
with  certainty  at  this  lime. 

Multiple  task  situations  in  which  operator  behavior  is  indeterminate  arise  when  the  work¬ 
load  is  such  that  a  single  operator  is  unable  to  perform  all  tasks  satisfactorily.  Such  a  situation 
presents  another  interesting  problem  which  can  be  investigated  from  the  present  perspective. 
One  method  of  maintaining  operator  workload  at  an  acceptable  level  involves  the  introduction  of 
automation  which  is  capable  of  performing  some  of  the  human’s  tasks.  The  multitask  model  for 
human  performance,  when  used  with  compatible  models  for  proposed  aids,  will  be  of  use  in 
predicting  the  performance  of  the  augmented  system.  Various  types  of  automation,  deployed  in 
multiple  configurations  could  be  analyzed  in  terms  of  their  utility  in  overall  system  performance 
enhancement.  Mode  switching  criteria  similar  to  those  proposed  in  the  absence  of  automation 
could  be  investigated.  Such  an  avenue  would  provide  an  interesting  application  of  the  multitask 
model  now  being  developed. 

The  work  proposed  for  the  next  year  can  be  partitioned  into  two  main  areas  of  activity. 

AREA  1.  The  GOM  developed  in  the  previous  reporting  period  must  be  completed,  simplified 
and  related  to  such  empirical  data  as  exists.  The  "completion"  of  the  GOM  has  been  discussed  in 
earlier  sections.  The  model  of  the  remote  operator  must  be  expressed  as  a  finite  dimensional  set 
of  stochastic  differential  equations.  This  will  be  accomplished  y  approximating  the  probability 
law  of  the  base  state  as  a  sum  of  two  specific  parametric  distributions. 

The  OCM  has  been  observed  to  be  rather  more  complicated  than  necessary.  The  analytical 
approach  leading  to  both  the  OCM  and  GOM  is  not  a  procedure  which  is  parsimonious  in  the 
number  of  free  parameters  in  the  model.  If  system  simulation  was  the  only  use  to  be  made  of  the 


-  81  - 


niodel,  this  wcuild  merely  be  a  nuisance.  For  a  careful  study  of  the  TOV'  system,  more  is 
required. 

To  simplify  the  COM,  the  "relevant"  portion  of  the  operator  model  must  be  distinguished 
from  the  peripheral  dynamics.  The  aim  ol  this  study  is  to  predict  system  performance,  not  to 
provide  a  highly  accurate  operator  model.  Hence,  those  aspects  of  the  COM  which  have  marginal 
influence  on  system  performance  should  be  deleted. 

The  indicated  simplification  and  reduced  parameterization  of  the  COM  is  essential  if  the 
empirical  data  from  the  test  facility  is  to  be  incorporated  into  the  final  model  determination. 
The  operator  ran  not  be  tested  apart  from  the  rest  of  the  system.  Further,  the  variables  that  del¬ 
ineate  the  (iOM  are  mathematical  artifices  Hence,  the  model  parameters  can  only  be  indirectly 
determined  from  experimental  data.  To  reduce  the  ambiguity  of  the  relationship  between  the 
model  and  the  empirical  data,  the  model  must  be  reduced  to  a  small  set  of  physically  substantive 
parameters.  Such  a  representation  makes  more  meaningful  any  sensitivity  study  based  upon  the 
COM. 


ARE.4  2.  The  system  description  must  be  expanded  to  a  multitask  environment  using  the  pro¬ 
cedures  discussed  earlier.  Both  nonrandom,  time  dependent  attention  switching  as  was  done 
with  the  OCM  -  and  situation  dependent  attention  switching  based  on  {Pi}  will  be  investigated. 
The  latter  is  clearly  the  most  apropos,  but  to  arrive  at  definitive  conclusions,  the  dynamic 
behavior  of  the  covariance  equations  must  be  better  understood. 

It  is  expected  that  the  proposed  work  will  complement  the  ongoing  experimental  activity  of 
the  TOV  test  facility.  The  modelling  activity  proposed  here  will  be  enriched  by  an  interaction 
and  technical  interchange  with  the  test  personnel  The  results  of  this  effort  will  al.so  aid  them  in 
che  development  of  test  protocols. 
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APPENDIX 

PROPOSITION  1.  Since  {m, }  and  { tc, }  are  { F, }  mariingales,  (3.13a)-(3.13b)  follow  immedi- 
atelv.  Further  {u,  }  is  Brownian  motion,  and  hence  (3.13c).  To  show  (3.13d),  observe  that  if 
{ r, }  makes  an  %  ->  j  transition  at  time  t. 


dni^dnii  = 


1  if  k  ^  I  and  k,  It  {«,  ;  } 
1  if  k  =  I  c{l,;} 

0  otherwise 


(A.1.1) 


Hence  if  =  i 


E  (dm* dm,  |  = 


-  9.*  if  /  =  » 

-  q^i  dt  if  k  =  i 

q„  dt  if  *  =  /  ?^  i 

-  qu  dt  if  it  =  /  =  I 


Let  q,  be  the  i’th  row  of  the  Q  matrix.  Then  if  =  1 

E  {dm, dm,  '  i  F,-}  =  {-<Piq,.  -  q'.4>,  '  4  diag  (9,  )  )dt 
=  Q,dt 

where  Q,  is  the  matrix  indicated  in  (AI.3).  It  then  follows  that 

N 

E  {  dm,  dm,  '  \  F,-}  ^  Q,  d>,-  ,  dt 

1  =  1 

=  V(d>,}dt 


(A.1.2) 


(A.1.3) 


(A.1.4) 


The  quantity  on  the  right  side  of  (A1.4)  is  called  d(m,  m''j,.  Note  that 

dm,  dm,  '  =  d^m  ,m  +  (dm,  dm,  '  -  d  |m ,  m  j,  ) 


=  dlm,m''i,  ■+  dm,  , 


vkhere  {m,  is  a  matrix  martingale,  i.e. 


E{{dm,,),,  (A  1.5) 

PROPOSITIO.N  2.  As  indicated  in  (3.3) 

dQ  =  Gu,)dt  ■*  martingale  increment  (.\.2.1) 

From  Krishnan  (1984),  Thm.  8.5.lj 

du  =  {F^U  -  Gv,)dt  ^  (A.2.2) 

where  it  has  been  noted  that  (ti, }  is  adapted  to  {  F^ }  and  that  {u’, }  and  {rjj}  are  independent. 

Equation  (A.2.2)  is  equivalent  to  (3.17). 

PROPOSITION  3.  The  derivation  of  (3.18)  is  as  follows.  From  (3.3)  and  (3.12)  the  error  pro¬ 
cess  satisfies 

df  =  *  f^dm  +  F^dw  -  P,H'R-Ui^,  (A.3.1) 

since  P,  =  ,  it  is  first  necessary  to  obtain  an  equation  for  f/  .  It  is  known  that 

<^(fi  fi'  )  =  )  (A,3.2) 

Expanding  (A.3.1),  and  using  the  fact  that 

(di/,)  (dt/,')  =  /?  d/  (A.3.3) 


it  follows  that 


88  - 


d;:  - 


{F.-  PH  R  'H)}}'  Ph  R-'B)'  ^  F^WF.  '  ^  F^V{4>)F„ 


+  PH  R-^HP 


dt  ~  F^  dm  j  F^  '  -r  F^  dm  f '  -  PH  '  R  ^dr,~'  4  F^dw^' 


(;dm  ^du.’  '  F^'  -  qdr)  '  R  ^  HF 


w.  fiere 


dm  j  =  dm,  dm,  '  -  V [<(>,)  dt 


Note  the  ff'  is  a  symmetric  matrix.  Let  us  consider  the  i’th  column  at  ff'; 


ff'  =  (ffi,  •  •  •  fft,  •  •  ■  ffjv) 


=  Kf?).. 


The  equation  of  evolution  of  f  f*  is  given  by  the  it’th  column  of  (A. 3. 4).  He;  ce, 

\  PH' R-^H)PU  +  {P{F,-  PH'R-^H)').,  -  {F„WF„') 

+  [F^  *  +  [PH  R-^HP)  ,\ dt  +  /'J.* 


+  (^m  dm  f'  ).*  +  ff'f*  /f'/?  ’di/, 


where  the  fact  that  (f„dmi  F„  ').*  is  a  martingale  increment  and  that 


Ei  {P  H'R-'drf^')„dr}-  I  K,  ^  =  0 


has  been  used.  Combining  the  columns  of  ffj  ,  and  noting  that  dm  f'  =  0,  it  follows  that 


dP,  =  (F^  +  PF;  -  PH'R-'HP  ^  F„wF^-  ^  F„^  V  t  „)  dt  ^  d  p,  ( 


dpt  =  (ff'  iiH'R  ^dv,  ,  ...,  H  R'dv.  ) 


where 


f’ROPOSITlON  4.  Beginning  as  in  Proposition  3,  consider  first  the  d\namir  equation  of  ifie  con- 


d  tianal  product  cc  s'*  As  indicated  previousiv 

d  (k)  ^  CC  )s,  -  s's'  (tfs*)  Cs''  )  (tf  s'*  ) 

The  term  d  cc  is  gisen  in  (A. 3. 4)  and 

s'*  -  (/’.s'J*tf^  -  '  {PH  'R  Ui^]y 

There  are  many  terms  in  expansion  (A.4.1),  and  it  is  well  to  look  at  them  individually 

(dK-  )'*  ^  (7\.  -  PH  R  -fy  f*(F,  -  PH  R-'H)',dt 

-  {F^dm  *  F^w  -  PR  R^^dn]}  },  -  V^kif'r.dm  ^  -  PH  I 

-  ~  ^  PH'R  ^HP)dt  .  F„dm,F^ 

}}'id}k)  =  }}'\if',})kdt  +  (Fmdm),  -  {F^dw),  -  [PH  R-'(H}dt  .  dr,,)), 

-}}  {(F^-  PH'  R-^H)}),dt  ^}}'(F^dm  ^F„du>  -  PH  R-^dr,) 

Finally  ,  if  we  neglect  trivial  terms 

(d  rf'  )  (d  f*)  =  {F„dm^F„  '  -  F„dm^'  +  ' F„  '  )  (F.r.dm), 

^  (F^dw^'  <;dw  F^'){F^dw), 

-  [PH  R-Ur,}  ^  }dr,'  R-^H  P)(PH  '  R  ‘drj), 

Equations  ( A.4.3j-( A.4.5)  give  us  the  basic  terms  that  we  need.  It  is.  however. 


(A.4.1) 

(A.4.2) 

■dr;)' 

(A.4.3) 

(A. 4. 4) 


(A.4.5) 

convenient 


to  simplify  some  of  the  terms  involved  before  combining  them  into  (.\.4.I  j.  Consider  the  second 
order  terms  in  (A.4.5). 
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Hence 


{F„dm  dm,  '^F^^.dm, 


=  E  ^m./ 

i.x 


El  Ffrui  F^j^  aCj  I  d ,  m  .  m  ,  ^  —  tffnjy^j 


(A.4.C) 


SimilarK 


[F^dm^F^  },j  (F„drr>),^  ^2  F^ta  dm,i^g  F^^g  ^  F^^  dm. 

a.fi  -, 

~  E  Ffrxi  a  Frnk  i  -^mj  S  1  a  3  dm  ^ 

0,3.1 


dm^ggdm.^  -  dm^dmgdm.^  -  Vggdm.^d( 


=  dm,,dm  gdm.^ 


{A.4.6) 


{F^dm^F^  ),,  )*  -  J]  F^^F^.iF„jgdm^dmgdm..^ 

0.3.1 


(A. 4.9) 


Combining  (,\.4.6)  and  (A. 4.9) 

{F„dm,F„  +  F„dm}’  +  fdmfj,,  ^  F^/f,  dm, dm, 


Fm,i  F^^^,  dm/dm  2  F„,^F^.^F„,gdm^dm  gdm. 


(A. 4. JO) 


Similarly 
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f ')„  {F,,dw),  --  j 

S  PwU  ‘i'^i  c 

S  Pwk\<i'X', 

X 

^wti  ^  u-k.  AS)  dUif  du 
i  A 


Y1  ^ wK  A  s';  /  A 

/.A 

» 


Hence 


[F^du' *  (;du' '  F^')^^(F^dw)^ 


^  P  .  F  ^  W 

■*  urt.'  '  trik  A  '' ;  '  /  A 

I  A 


^  w),  F A  Cl  ^ 


XI 


Finally 


(PH-R~Ur,}')„[PH'R-'dr,), 


^(PH'R~^)„dn^}^ 


^{PH 


T.iPH  R-')APH'R-^),,}^dr,^ 

/.A 

-j:(PH'R^^)„{PH'RAk^},  P: 

/.A 


(PH  R  ^ HP),,  }.  dt 


Consider  next  the  expectation  of  the  second  order  lerni'  ■'!'■<  * 

t  {  -  ) 


it  follows  that 


t 


A  ii\i 


f  ,  f  H  >■ 


|)ef  rie  4  o.rt’  '  <  \  ' 


(A, 4. 11) 


dt  (A. 4. 12) 


P'^)kx  dVx 

dnx 

dt 


y  ) 
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(A{i))„  dl  =  Y!i  dm,  dm^  +  dm,dmx 


i,x 


+  E  ^m.af'nktf'njpdm^dmfdm^ 
“,0,1 


(A.4.16) 


T  :en 


=  Hk)dt 


(A.4.17) 


From  the  above  it  is  possible  to  evaluate  the  third  conditional  moment.  First  note  that 

d(}}-}x)  -  PH-R-^HY'i,  +  {(F^~  PH'R-^H)}),) 

+  (F,-  PH'R-^H)}}'},  +  }{}'{F„dm  +  F^dw), 

+  +  F„dw)')  +  ((F„dm  +  F^dw)^}-t-  (F„dm  -t-  F.du;) '  j*  )  j' 

-  Tf'  PH  'R-'dT,)k  +  f*  (PH  'R-'dri) ')  -  { (PH  'R-^dr,)t }  +  (PH  'R'^n)  u)}' 


+  F„dm  dm  +  A(t)  dt  +  martingale  increment  terms 

The  expectation  of  the  right  hand  side  of  (A.4.18)  can  be  produced  as  follows.  Define 

E{}}'}x  1  rj  =  n(A) 

Then 


PH'R-^HU,  =  n(*)(Ff-  PH’R-^H) 


f  f'  ( (F,  -  PH  R-'H)  f  )*  =  E  f'f'  (F,  -  PH  'R-^H)„  }, 


=  i:n(l){F,-  PH’R-^H)„ 

t 


Similarly 

}{}'(Fmdm  -l-f.dw)*  +f*(F’„dm  F.dw)' )  -  ff' (F„dm)* 

+  fft  dm  'F„  '  =  0 


}{}'  (PH  R-Ur,),  +  ft  (PH  ’R-Ur,) ' )  =  0 


(A.4.18) 


(A.4.19) 


{A.4.20) 


(A.4.21) 

(A.4.21) 

(A.4.22) 
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p 

=  T,Vik{PH'Tt-%eRff,dt 
p 

=  lik(PH')„dt 
=  f*  (?  {HP),  ).,  dt 


Consequently 


([}}'{PH‘R-^dr,),).^dr,l,,  +({}{PH'R-^dn)'yXd^j 


*  PAPHlk,dt  +P^{PH\ 


Combining  these  terms 

d  { (ff' f*)./ , »?' )  =  {P.i{PH')k-  +  P-kiPHlr  +  {HP).,P,.  +  {HP),  P,.)dt 

Hence 


dn{k),  =  fi{k),  dt  +  [(P.,(P/f')*.  +  PAPH’)i.  +  {HP).tP,. 


-^{HP),PA  +  Auff' 


where 


(A.4.27) 


(A.4.28) 


(A.4.29) 


(A.4.30) 


(A.4.31) 
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PROPOSITION  5.  This  proposition  follows  directly  from  a  decomposition  of  costs  like  that  per- 


Hence 


|Tse  (1971)!. 

Note  that  if  r  ^  t 

Y,}  =  Y,}\ 

• 

^  eI^eu/mIi  Y,}  , 

if 

V,  } 

-  Eil'Ml  ^  TfMP,  1 

Z.  } 

T 

T 

II 

J  u/Nu,)dr  i  Y, 

t 

f  MP4r  \  Y, 

i 

. 

(A  5.1 ) 


(A.5.2) 


Equation  (A. 3. 8)  indicates  that  {P^}  is  independent  of  the  control  policy.  Consequently,  {u, } 
should  be  selected  to  minimize  the  first  term  in  (A.5.2). 

Some  care  must  be  exercised  in  the  indicated  minimization  because  the  dynamics  of  ^  are 
not  linear,  e.g.  {P,}  depends  upon  {Uf }  in  •(A.2.2).  Define 


H,  =  E 


f  Uf'M  dT\  Y, 


H(  =  { '  M  ■*  Uf'  >Vu, )  dt  dH, 

SuppK>se  that  {H,}  has  the  form 

Hf  =  E,  8, 

with  satisfying  the  usual  assumptions 

dH,  =  2},'L,d},  +  fi'E,  f,  +  Tr  E,(d  f,)  (d  c,) 

=  2f,'E,((F,f,  +  Gu,)dt  +  P,H'R  'du,)  *  s,)  dl 


(A.5.3) 


(A.5.4) 


(A.5.4) 
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+  Tr 

=  (2f'  {L,F^  +  t,)},  +  2f'E,G,u,  4  i,  +  Tr  i:,PH '  R-^HP,)  dt 

Bui  the  last  term  in  (A. 5. 6)  is  a  martingale  increment,  and  the  representation  of  {H,} 
form  developed  in  ITse,  1971  j.  The  solution  to  the  regulation  problem  is  well  known; 

where 

t,  =  -  f/  E,  -  +  H.BR-^B  'E,  -  M 

^  f  ~  0 


PROPOSITION  6.  Specializing  (A. 2. 2)  to  the  }  subsystem,  it  follows  that 

ddi  =  Q  <t>  dt  ■¥  P^bR^^  du^ 

where 


dy^  ~  b  d  dt 


But 


Define 


Then 


P^  •  E  {dd'  '  y^,i  }  -  dd' 

=  diag  {0,,  •  •  ■  ,  ds)  -  d^' 


B  »  diag  (6,,  •  ,bf^) 

b  •  d'b 


=*  Q  '  dfdt  +  (B,  -  b,l)  d,  R^  '  di/^ 


(A. 5. 6) 
is  in  the 


{A.5.7) 


{A.5.8) 


(A.6.1) 


(A.6.2) 


(A. 6.3) 


(A.6.4) 


d  dt 


(A.6.5) 


